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| Preface 



This is Part I - Fundamentals of Applied Discrete Structures, containing most of the fundamental concepts taught in a one semester course 
in discrete mathematics. It corresponds with the content of Discrete Structures I at UMass Lowell, which is a required course for students 
in Computer Science, Mathematics and Information Technology. Part II - Applied Abstract Algebra would normally be used in a second 
semester course such as Discrete Structures II at UMass Lowell. 

Twenty-one years after the publication of the 2 nd edition of Applied Discrete Structures for Computer Science, in 1989 the publishing and 
computing landscape have both changed dramatically. We signed a contract for the second edition with Science Research Associates but by the 
time the book was ready to print, SRA had been sold to MacMillan. Soon after, the rights had been passed on to Pearson Education, Inc. In 
2010, the long-term future of printed textbooks is uncertain. In the meantime, textbook prices (both printed and e-books) have increased and a 
growing open source textbook market movement has started. One of our objectives in revisiting this text is to make it available to our students 
in an affordable format. In its original form, the text was peer-reviewed and was adopted for use at several universities throughout the country. 
For this reason, we see Applied Discrete Structures as not only an inexpensive alternative, but a high quality alternative. 

As indicated above the computing landscape is very different from the 1980's and accounts for the most significant changes in the text. One of 
the most common programming languages of the 1980's, Pascal; and we used it to illustrate many of the concepts in the text. Although it isn't 
totally dead, Pascal is far from the mainstream of computing in the 21 st century. In 1989, Mathematica had been out for less than a year — 
now a major force in scientific computing. The open source software movement also started in the 1980's and in 2005, the first version of 
Sage, an open-source alternative to Mathematica was first released. In Applied Discrete Structures we have replaced "Pascal Notes" with 
"Mathematica Notes" and "Sage Notes." Finally, 1989 was the year that World Wide Web was invented by Tim Berners-Lee. There wasn't a 

single www in the 2 nd edition. In this version, we intend to make use of extensive web resources, including video demonstrations. 

We would like to thank Tony Penta, Sitansu Mittra, and Dan Klain for using the preliminary versions of Applied Discrete Structures. The 
corrections and input they provided was appreciated. 

We repeat the preface to Applied Discrete Structures for Computer Science below. Plans for the instructor's guide, which is mentioned in the 
preface are uncertain at this time. 



I Preface to Applied Discrete Structures for Computer Science, 2nd Ed. 



We feel proud and fortunate that most authorities, including MAA and ACM, have settled on a discrete mathematics syllabus that is virtually 
identical to the contents of the first edition of Applied Discrete Structures for Computer Science. For that reason, very few topical changes 
needed to be made in this new edition, and the order of topics is almost unchanged. The main change is the addition of a large number of 
exercises at all levels. We have "fine-tuned" the contents by expanding the preliminary coverage of sets and combinatorics, and we have added 
a discussion of binary integer representation. We have also added an introduction including several examples, to provide motivation for those 
students who may find it reassuring to know that mathematics has "real" applications. "Appendix B — Introduction to Algorithms," has also 
been added to make the text more self-contained. 

How This Book Will Help Students 

In writing this book, care was taken to use language and examples that gradually wean students from a simpleminded mechanical approach and 
move them toward mathematical maturity. We also recognize that many students who hesitate to ask for help from an instructor need a readable 
text, and we have tried to anticipate the questions that go unasked. 

The wide range of examples in the text are meant to augment the "favorite examples" that most instructors have for teaching the topics in 
discrete mathematics. 

To provide diagnostic help and encouragement, we have included solutions and/or hints to the odd-numbered exercises. These solutions include 
detailed answers whenever warranted and complete proofs, not just terse outlines of proofs. 

Our use of standard terminology and notation makes Applied Discrete Structures for Computer Science a valuable reference book for future 
courses. Although many advanced books have a short review of elementary topics, they cannot be complete. 

How This Book Will Help Instructors 

The text is divided into lecture-length sections, facilitating the organization of an instructor's presentation. 

Topics are presented in such a way that students' understanding can be monitored through thought-provoking exercises. The exercises require 
an understanding of the topics and how they are interrelated, not just a familiarity with the key words. 

An Instructor's Guide is available to any instructor who uses the text. It includes: 

(a) Chapter-by-chapter comments on subtopics that emphasize the pitfalls to avoid; 

(b) Suggested coverage times; 

(c) Detailed solutions to most even-numbered exercises; 

(d) Sample quizzes , exams , and final exams . 
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How This Book Will Help the Chairperson/Coordinator 

The text covers the standard topics that all instructors must be aware of; therefore it is safe to adopt Applied Discrete Structures for Computer 
Science before an instructor has been selected. 

The breadth of topics covered allows for flexibility that may be needed due to last-minute curriculum changes. 

Since discrete mathematics is such a new course, faculty are often forced to teach the course without being completely familiar with it. An 
Instructor's Guide is an important feature for the new instructor. 

What a Difference Five Years Makes! 

In the last five years, much has taken place in regards to discrete mathematics. A review of these events is in order to see how they have 
affected the Second Edition of Applied Discrete Structures for Computer Science. 

(1) Scores of discrete mathematics texts have been published. Most texts in discrete mathematics can be classified as one-semester or two- 
semester texts. The two-semester texts, such as Applied Discrete Structures for Computer Science, differ in that the logical prerequisites for a 
more thorough study of discrete mathematics are developed. 

(2) Discrete mathematics has become more than just a computer science support course. Mathematics majors are being required to take it, often 
before calculus. Rather than reducing the significance of calculus, this recognizes that the material a student sees in a discrete mathematics/struc- 
tures course strengthens his or her understanding of the theoretical aspects of calculus. This is particularly important for today's students, since 
many high school courses in geometry stress mechanics as opposed to proofs. The typical college freshman is skill-oriented and does not have a 
high level of mathematical maturity. Discrete mathematics is also more typical of the higher-level courses that a mathematics major is likely to 
take. 

(3) Authorities such as MAA, ACM, and A. Ralson have all refined their ideas of what a discrete mathematics course should be. Instead of the 
chaos that characterized the early '80s, we now have some agreement, namely that discrete mathematics should be a course that develops 
mathematical maturity. 

(4) Computer science enrollments have leveled off and in some cases have declined. Some attribute this to the lay-offs that have taken place in 
the computer industry; but the amount of higher mathematics that is needed to advance in many areas of computer science has also discouraged 
many. A year of discrete mathematics is an important first step in overcoming a deficiency in mathematics. 

(5) The Educational Testing Service introduced its Advanced Placement Exam in Computer Science. The suggested preparation for this exam 
includes many discrete mathematics topics, such as trees, graphs, and recursion. This continues the trend toward offering discrete mathematics 
earlier in the overall curriculum. 

Acknowledgments 

The authors wish to thank our colleagues and students for their comments and assistance in writing and revising this text. Among those who 
have left their mark on this edition are Susan Assmann, Shim Berkovitz, Tony Penta, Kevin Ryan, and Richard Winslow. 

We would also like to thank Jean Hutchings, Kathy Sullivan, and Michele Walsh for work that they did in typing this edition, and our depart- 
ment secretaries, Mrs. Lyn Misserville and Mrs. Danielle White, whose cooperation in numerous ways has been greatly appreciated. 

We are grateful for the response to the first edition from the faculty and students of over seventy-five colleges and universities . We know that 
our second edition will be a better learning and teaching tool as a result of their useful comments and suggestions. Our special thanks to the 
following reviewers: David Buchthal, University of Akron; Ronald L. Davis, Millersville University; John W Kennedy, Pace University; Betty 
Mayfield, Hood College; Nancy Olmsted, Worcester State College; and Pradip Shrimani, Southern Illinois University. Finally, it has been a 
pleasure to work with Nancy Osman, our acquisitions editor, David Morrow, our development editor, and the entire staff at SRA. 

A.W.D. 
K.M.L. 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed under a Creative Commons Attribution-Noncommercial-ShareAlike 3.0 United 



Applied Discrete Structures 



I Introduction 



What Is Discrete Mathematics/Structures? 

What is Discrete Mathematics? 

As a general description one could say that discrete mathematics is the mathematics that deals with "separated" or discrete sets of objects rather 
than with continuous sets such as the real line. For example, the graphs that we learn to draw in high school are of continuous functions. Even 
though we might have begun by plotting discrete points on the plane, we connected them with a smooth, continuous, unbroken curve to form a 
straight line, parabola, circle, etc. The underlying reason for this is that hand methods of calculation are too laborious to handle huge amounts of 
discrete data. The computer has changed all of this. 

Today, the area of mathematics that is broadly called "discrete" is that which professionals feel is essential for people who use the computer as 
a fundamental tool. It can best be described by looking at our Table of Contents. It involves topics like sets, logic, and matrices that students 
may be already familiar with to some degree. In this Introduction, we give several examples of the types of problems a student will be able to 
solve as a result of taking this course. The intent of this Introduction is to provide an overview of the text. Students should read the examples 
through once and then move on to Chapter One. After completing their study of discrete mathematics, they should read them over again. 

We hope discrete mathematics is as fascinating and enjoyable to the student as it has been to us. 

Example l.a. Analog-to-digital Conversion. A common problem encountered in engineering is that of analog-to-digital (a-d) conversion, where 
the reading on a dial, for example, must be converted to a numerical value. In order for this conversion to be done reliably and quickly, one 
must solve an interesting problem in graph theory. Before this problem is posed, we will make the connection between a-d conversion and the 
graph problem using a simple example. Suppose a dial in a video game can be turned in any direction, and that the positions will be converted 
to one of the numbers zero through seven in the following way. As depicted in Figure I.a.l , the angles from to 360 are divided into eight equal 
parts, and each part is assigned a number starting with and increasing clockwise. If the dial points in any of these sectors the conversion is to 
the number of that sector. If the dial is on the boundary, then we will be satisfied with the conversion to either of the numbers in the bordering 
sectors. This conversion can be thought of as giving an approximate angle of the dial, for if the dial is in sector k, then the angle that the dial 
makes with east is approximately 45 k° . 



/\ s 


ft /\ 


1 j\ 


Lj' ' 1 


V */* 


\ (1 J 











FIGURE l.a.1 

Now that the desired conversion has been described, we will describe a "solution" that has one major error in it, and then identify how this 
problem can be rectified. All digital computers represent numbers in binary form, as a sequence of Os and Is called bits, short for binary digits. 
The binary representations of numbers through 7 are: 

0= 000 = 0/4 + 0x2 + 0x1 

1 =001 =0x4 + 0x2 + lxl 

2 =010 = 0/4+ 1x2 + 0x1 

3 = 011= 0x4 + 1x2 + lxl 

4 = 100= 1/4 + 0x2 + 0x1 

5 = 101 = 1/4 + 0x2 + 1x1 

6 = 110= 1/4+ 1x2 + 0x1 

7 = 111 = 1x4 + 1x2 + lxl 

We will discuss the binary number system in Chapter 1 . The way that we could send those bits to a computer is by coating parts of the back of 
the dial with a metallic substance, as in Figure I.a.2. For each of the three concentric circles on the dial there is a small magnet. If a magnet lies 
under a part of the dial that has been coated with metal, then it will turn a switch ON, whereas the switch stays OFF when no metal is detected 
above a magnet. Notice how every ON/OFF combination of the three switches is possible given the way the back of the dial is coated. 

If the dial is placed so that the magnets are in the middle of a sector, we expect this method to work well. There is a problem on certain 
boundaries, however. If the dial is turned so that the magnets are between sectors three and four, for example, then it is unclear what the result 
will be. This is due to the fact that each magnet will have only a fraction of the required metal above it to turn its switch ON. Due to expected 
irregularities in the coating of the dial, we can be safe in saying that for each switch either ON or OFF could be the result, and so if the dial is 
between sectors three and four, any number could be indicated. This problem does not occur between every sector. For example, between 
sectors and 1, there is only one switch that cannot be predicted. No matter what the outcome is for the units switch in this case, the indicated 
sector must be either or 1 , which is consistent with the original objective that a positioning of the dial on a boundary of two sectors should 
produce the number of either sector. 
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FIGURE l.a.2 

Is there a way to coat the sectors on the back of the dial so that each of the eight patterns corresponding to the numbers to 7 appears once, and 
so that between any two adjacent sectors there is only one switch that will have a questionable setting? One way of trying to answer this 
question is by using an undirected graph called the 3-cube (Figure I.a.3). In general, an undirected graph consists of vertices (the circled O's and 
1 's in the 3-cube) and the edges , which are lines that connect certain pairs of vertices . Two vertices in the 3-cube are connected by an edge if the 
sequences of the three bits differ in exactly one position. If one could draw a path along the edges in the 3-cube that starts at any vertex, passes 
through every other vertex once, and returns to the start, then that sequence of bit patterns can be used to coat the back of the dial so that 
between every sector there is only one questionable switch. Such a path is not difficult to find; so we will leave it to you to find one, starting at 
000 and drawing the sequence in which the dial would be coated. 




Many A-D conversion problems require many more sectors and switches than this example, and the same kinds of problems can occur. The 
solution would be to find a path within a much larger yet similar graph. For example, there might be 1 ,024 sectors with 10 switches, resulting in 
a graph with 1 ,024 vertices. One of the objectives of this text will be to train you to understand the thought processes that are needed to attack 
such large problems. In Chapter 9 we will take a closer look at graph theory and discuss some of its applications. 

One question might come to mind at this point. If the coating of the dial is no longer as it is in Figure I.a.2, how would you interpret the patterns 
that are on the back of the dial as numbers from to 7? In Chapter 14 we will see that if a certain path is used, this "decoding" is quite easy. 

The 3-cube and its generalization, the n-cube, play a role in the design of a multiprocessor called a hypercube. A multiprocessor is a computer 
that consists of several independent processors that can operate simultaneously and are connected to one another by a network of connections. 
In a hypercube with M = 2" processors, the processors are numbered to M - 1. Two processors are connected if their binary representations 
differ in exactly one bit. The hypercube has proven to be the best possible network for certain problems requiring the use of a "supercomputer." 
Denning's article in the May-June 1987 issue of "American Scientist" provides an excellent survey of this topic. 

Example I.b. Logic is the cornerstone of all communication, whether we wish to communicate in mathematics or in any other language. It is 
the study of sentences, or propositions, that take on the values true or false, 1 or in the binary system. Its importance was recognized in the 
very early days of the development of logic (hardware) design, where Boolean algebra, the algebra of logic, was used to simplify electronic 
circuitry called gate diagrams. Consider the following gate diagram: 



ii. ii, ^j,h ■'■■(<, v nl 




FIGURE l.b.1 



Each symbol in this diagram is called a gate, a piece of hardware. In Chapter 13 we will discuss these circuits in detail. Assume that this 
circuitry can be placed on a chip which will have a cost dependent on the number of gates involved. A classic problem in logic design is to try 
to simplify this circuitry to one containing fewer gates. Indeed, the gate diagram can be reduced to 
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FIGURE I. b.2 
The result is a less costly chip. Since a company making computers uses millions of chips, we have saved a substantial amount of money. 

This use of logic is only the "tip of the iceberg." The importance of logic for computer scientists in particular, and for all people who use 
mathematics, cannot be overestimated. It is the means by which we can think and write clearly and precisely. Logic is used in writing algo- 
rithms, in testing the correctness of programs, and in other areas of computer science. 

Example I.e. Suppose two students miss a class on a certain day and borrow the class notes in order to obtain copies. If one of them copies the 
notes by hand and the other walks to a "copy shop," we might ask which method is more efficient. To keep things simple, we will only consider 
the time spent in copying, not the cost. We add a few more assumptions: copying the first page by hand takes one minute and forty seconds 
(100 seconds); for each page copied by hand, the next page will take five more seconds to copy, so that it takes 1:45 to copy the second page, 
1:50 to copy the third page, etc.; photocopiers take five seconds to copy one page; walking to the "copy shop" takes ten minutes, each way. 

One aspect of the problem that we have not specified is the number of pages to be copied. Suppose the number of pages is n, which could be 
any positive integer. As with many questions of efficiency, one method is not clearly better than the other for all cases. Since the only variable 
in this problem is the number of pages, we can simply compare the copying times for different values of n. We will denote the time it takes (in 
seconds) to copy n pages manually by t h (n), and the time to copy n pages automatically by t a (n). Ideally, we would like to have formulas to 
represent the values of t h (n) and t a (n). The process of finding these formulas is an important one that we will examine in Chapter 8. The 
formula for t a (n) is not very difficult to derive from the given information. To copy pages automatically, one must walk for twenty minutes 
(1 ,200 seconds), and then for each page wait five seconds. Therefore, t a (n) = 1200 + 5 n 

The formula for t h (n) isn't quite as simple. First, let p(n) be the number of seconds that it takes to copy page n. From the assumptions, 
p (1) = 100, and if n is greater than one, p (n) = p(n — 1) + 5. The last formula is called a recurrence relation. We will spend quite a bit of 
time discussing methods for deriving formulas from recurrence relations. In this case p (n) = 95 + 5 n. Now we can see that if n is greater than 
one, 

t h (n) = p(\) + p(2) + ••• + p(n) = t h (n - 1) + p(n) = t h {n - 1) + 5 n + 95 

This is yet another recurrence relation. The solution to this one is t h (n) = 97 .5 n + 2.5 n . 

Now that we have these formulas, we can analyze them to determine the values of n for which hand copying is most efficient, the values for 
which photocopying is most efficient, and also the values for which the two methods require the same amount of time. 

WHAT IS DISCRETE STRUCTURES? 

So far we have given you several examples of that area of mathematics called discrete mathematics. Where does the "structures" part of the title 
come from? We will look not only at the topics of discrete mathematics but at the structure of these topics. If two people were to explain a 
single concept, one in German and one in French, we as observers might at first think they were expressing two different ideas, rather than the 
same idea in two different languages. In mathematics we would like to be able to make the same distinction. Also, when we come upon a new 
mathematical structure, say the algebra of sets, we would like to be able to determine how workable it will be. How do we do this? We compare 
it to something we know, namely elementary algebra, the algebra of numbers. When we encounter a new algebra we ask ourselves how similar 
it is to elementary algebra. What are the similarities and the dissimilarities? When we know the answers to these questions we can use our vast 
knowledge of basic algebra to build upon rather than learning each individual concept from the beginning. 
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chapter 1 




SET THEORY I 



GOALS 

In this chapter we will cover some of the basic set language and notation that will be used throughout the text. Venn diagrams will be intro- 
duced in order to give the reader a clear picture of set operations. In addition, we will describe the binary representation of positive integers 
(Section 1.4) and introduce summation notation and its generalizations (Section 1.5). 
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Chapter 1 - Set Theory 



1 1.1 Set Notation and Relations 



The term set is intuitively understood by most people to mean a collection of objects that are called elements (of the set). This concept is the 
starting point on which we will build more complex ideas, much as in-geometry where the concepts of point and line are left undefined. 

Because a set is such a simple notion, you may be surprised to learn that it is one of the most difficult concepts for mathematicians to define to 
their own liking. For example, the description above is not a proper definition because it requires the definition of a collection. (How would you 
define "collection"?) Even deeper problems arise when you consider the possibility that a set could contain itself. Although these problems are 
of real concern to some mathematicians , they will not be of any concern to us . 

Our first concern will be how to describe a set; that is , how do we most conveniently describe a set and the elements that are in it? If we are 
going to discuss a set for any length of time, we usually give it a name in the form of a capital letter (or occasionally some other symbol). In 
discussing set A, if x is an element of A, then we will write x e A. On the other hand, if x is not an element of A, we write x £ A. The most 
convenient way of describing the elements of a set will vary depending on the specific set. 

Method 1: Enumeration. When the elements of a set are enumerated (or listed) it is traditional to enclose them in braces. For example, the set 
of binary digits is {0, 1) and the set of decimal digits is {0, 1, 2, 3, 4, 5, 6, 7, 8, 9). The choice of a name for these sets would be arbitrary; 
but it would be "logical" to call them B and D , respectively. The choice of a set name is much like the choice of an identifier name in program- 
ming. Some large sets can be enumerated without actually listing all the elements. For example, the letters of the alphabet and the integers from 
1 to 100 could be described as 

A = (a, b, c, ..., x, y, z),andG = (1, 2, ... , 99, 100). 

The three consecutive "dots" are called an ellipsis. We use them when it is clear what elements are included but not listed. An ellipsis is used in 
two other situations. To enumerate the positive integers, we would write {1, 2, 3, ... ), indicating that the list goes on infinitely. If we want to 
list a more general set such as the integers between 1 and n, where n is some undetermined positive integer, we might write { 1 , . . . , n). 

Method 2: Standard Symbols. Frequently used sets are usually given symbols that are reserved for them alone. For example, since we will be 
referring to the positive integers throughout this book, we will use the symbol P instead of writing {1, 2, 3, ...). A few of the other sets of 
numbers that we will use frequently are: 

N = the natural numbers = {0, 1, 2, 3, ...). 

Z = the integers = {..., -3, -2, -1, 0, 1, 2, 3, ...) 

Q = the rational numbers . 

R = the real numbers . 

C = the complex numbers . 

Method 3: Set-Builder Notation. Another way of describing sets is to use set-builder notation. For example, we could define the rational 
numbers as 

Q= {a/b : a, fceZ, btO] 
Note that in the set-builder description for the rational numbers: 

(1) a/b indicates that a typical element of the set is a "fraction." 

(2) The colon is read "such that" or "where," and is used interchangeably with a vertical line, |. 

(3) a, b e Z is an abbreviated way of saying a and b are integers. 

(4) All commas in mathematics are read as "and." 

The important fact to keep in mind in set notation, or in any mathematical notation, is that it is meant to be a help, not a hindrance. We hope 
that notation will assist us in a more complete understanding of the collection of objects under consideration and will enable us to describe it in 
a concise manner. However, brevity of notation is not the aim of sets. If you prefer to write a e Z and teZ instead of a, be Z, you should 
do so. Also, there are frequently many different, and equally good, ways of describing sets. For example, |xeR| x 2 - 5 x + 6 = 0) and 
{x | x<=R : x 2 - 5 x + 6 = 0} both describe the solution set {2, 3). 

A proper definition of the real numbers is beyond the scope of this text. It is sufficient to think of the real numbers as the set of points on a 
number line. The complex numbers can be defined using set-builder notation as C = {a + bi:a, b e R), where i 2 = -1. 
In the following definition we will leave the word "finite" undefined. 

Definition: Finite Set. A set is a finite set if it has a finite number of elements. Any set that is not finite is an infinite set. 

Definition: Cardinality. Let A be a finite set. The number of different elements in A is called its cardinality and is denoted by \A\. 

As we will see later, there are different infinite cardinalities. We can't make this distinction now, so we will restrict cardinality to finite sets until 
later. 
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SUBSETS 

Definition: Subset. Let A and B be sets. We say that A is a subset of B (notation AcB) if and only if every element of A is an element of 
B. 

Example 1.1.1. 

(a) If A = {3,5, 8} and B = {5, 8, 3, 2, 6}, then AcB. 

(b) NcZcQcRcC 

(c)IfA = {3, 5, 8)andfi = {5, 3, 8), then A c B and B c A. 

Definition: Equality. Let A and B be sets. We say that A is equal to B (notation A = B) if and only if every element of A is an element of 
B and conversely every element ofB is an element of A; that is, AcB and B cA. 

Example 1.1.2. 

(a) In Example 1 .1 ,1c, A = B. Note that the ordering of the elements is unimportant. 

(b) The number of times that an element appears in an enumeration doesn't affect a set. For example, if A = {1, 5, 3, 5)andB = {1, 5, 3), 
then A = B. Warning to readers of other texts: Some books introduce the concept of a multiset, in which the number of occurrences of an 
element matters. 

A few comments are in order about the expression "if and only if" as used in our definitions. This expression means "is equivalent to saying," or 
more exactly, that the word (or concept) being defined can at any time be replaced by the defining expression. Conversely, the expression that 
defines the word (or concept) can be replaced by the word. 

Occasionally there is need to discuss the set that contains no elements, namely the empty set, which is denoted by the Norwegian letter 0. This 
set is also called the null set. 

It is clear, we hope, from the definition of a subset, that given any set A we have A c A and c A. Both and A are called improper subsets of 
A.IfBcA,S + 0, and B =/= A , then B is called a proper subset of A . 

EXERCISES FOR SECTION 1.1 
A Exercises 

1 . List four elements of each of the following sets: 

(a) {k e P | k - 1 is a multiple of 7} 

(b) {x | x is a fruit and x ' s skin is normally eaten) 

(c) |xeQ|ieZ| 

(d) (2«|nsZ, «<0| 

(e) [s\s = 1 + 2 + •■■ + n, neP} 

2. List all elements of the following sets: 

(a) {i| ne{3,4, 5,6}} 

(b) {a e the alphabet | a precedes F) 

(c) {-* |ieP| 

(d)(« 2 | n = -2, -1, 0, 1, 2) 
(e){n e P | n is a factor of 24) 

3. Describe the following sets using set-builder notation. 

(a) {5, 7, 9, ... , 77, 79) 

(b) the rational numbers that are strictly between - 1 and 1 

(c) the even integers 

(d) (-18, -9,0,9, 18,27, . . .) 

4. Use set-builder notation to describe the following sets: 

(a) {1,2,3,4,5,6,7} 

(b) {1,10,100,1,000,10,000} 
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(c) {1,1/2,1/3,1/4,1/5,...} 

(d) {0} 

5. Let A= {0,2, 3},B = {2, 3}, and C = {1,5,9}. Determine which of the following statements are true. Give reasons for your answers . 

a. 3 e A 

b. (3) e A 

c. {3) QA 

d. BqA 

e. AqB 

f. 0cC 

g. 0eA 
h. Ac A 

C Exercise 

6. One reason that we left the definition of a set vague is Russell's Paradox. Many mathematics and logic books contain an account of this 
paradox. Two references are Stoll and Quine. Find one such reference and read it. 
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1 1.2 Basic Set Operations 



Definition: Intersection. Let A and B be sets. The intersection of A and B (denoted by A f] B) is the set of all elements that are in both A 
and B. That is, A f] B = {x : x e Aandx e B). 

Example 1.2.1. 

(a) Let A = (1, 3, 8)andS = (-9, 22, 3}. Then A f] B = (3). 

(b) Solving a system of simultaneous equations such as x + y = 7 and x — y = 3 can be viewed as an intersection. Let 
A = \(x, y) : x + y = 7, x, y e R) and B = {(x, y) : x - y - 3, x, y e R). These two sets are lines in the plane and their intersection, 
A p| B = {(5, 2)}, is the solution to the system. 

(c) z n Q = Z. 

(d)IfA = {3, 5, 9}andS = {-5, 8), then A f] B = 0. 

Definition: Disjoint Sets. Two sete are disjoint if they have no elements in common (as in Example 1 .2.1 d). That is, A and B are disjoint 
ifAf]B = 0. 

Definition: Union. Let A and B be sets. The union of A and B (denoted by A IJ B) is the set of all elements that are in A or in B or in 
both A and B. That is, A \J B = \x : x e A or x e B\. 

It is important to note in the set-builder notation for A[J B, the word "or" is used in the inclusive sense; it includes the case where x is in both A 
andS. 

Example 1.2.2. 

(a) If A = {2, 5, 8}andS = {7, 5, 22), then A U B = {2, 5, 8, 7, 22). 

(b) ZUQ=Q- 

(c) A (J = A for any set A. 

Frequently, when doing mathematics, we need to establish a universe or set of elements under discussion. For example, the set 
A = [x : 81 x 4 - 16 = 0} contains different elements depending on what kinds of numbers we allow ourselves to use in solving the equation 
81 x 4 — 16 = 0. This set of numbers would be our universe. For example, if the universe is the integers, then A is 0. If our universe is the 
rational numbers, then A is {2/3, -2/3} and if the universe is the complex numbers, then A is {2/3, -2/3, 2i/3, — 2J/3). 

Definition: Universe. The universe, or universal set, is the set of all elements under discussion for possible membership in a set. 
We normally reserve the letter U for a universe in general discussions. 

VENN DIAGRAMS 

When working with sets, as in other branches of mathematics, it is often quite useful to be able to draw a picture or diagram of the situation 
under consideration. A diagram of a set is called a Venn diagram. The universal set U is represented by the interior of a rectangle and the sets 
by disks inside the rectangle. 

Example 1.2.3. 

(a) A H B is illustrated in Figure 1.2.1 by shading the appropriate region. 




FIGURE 1.2.1 Venn diagram for intersection 



(b) A U B is illustrated in Figure 1.2.2. 
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FIGURE 1 .2.2 Venn diagram for union 

In a Venn diagram, the region representing A f] B does not appear empty; however, in some instances it will represent the empty set. The same 
is true for any other region in a Venn diagram. 

Definition: Complement. Let A and B be sets. The complement of A relative to B (notation B - A) is the set of elements that are in B 
and not in A. That is, B - A = [x : x e B and x ( A). If U is the universal set, then U - A is denoted by A c and is called simply the comple- 
ment of A. A c ={x e U : x £ A\. 

Example 1.2.4. 

(a) Let5 = (1,2, 3, ... , 10) and A = (2,4,6,8, 10). Then U - A = (1, 3, 5, 7, 9) and A - U = 

(b) If U = R, then the complement of the rational numbers is the irrational numbers . 

(c) U c = and C = U. 

(d) The Venn diagram of A - B is represented in Figure 1 .2.3. 




FIGURE 1 .2.3 Venn diagram for A - B 
(e) The Venn diagram of A c is represented in Figure 1 .2.4. 
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FIGURE 1 .2.4 Venn diagram for A c 
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(f) If B c A, then the Venn diagram of A - 6 is in Figure 1.2.5. 
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FIGURE 1.2.5 Venn diagram for A- 6 where Sis contained in A 
(g) In the universe of integers , the set of even integers ,{... , - 4 , -2 , 0, 2, 4, ...}, has the set of odd integers as its complement . 

Definition: Symmetric Difference. Let A and B be sets. The symmetric difference of A and B (denoted by A @B) is the set of all elements 
that are in AandB but not in both. That is, A &) B = (A (J B) - (A f] B). 

Example 1.2.5. 

(a) Let A = {1. 3. 8}andS = {2, 4. 8}.ThenA0S = {1, 2, 3, 4). 

(b) A © = A and A © A = for any set A. 

(c) IR © Q = the irrational numbers. 

(d) The Venn diagram of A © B is represented in Figure 1 .2.6. 




# 



Mathematica Note 



One of the basic objects in Mathematica is a list. A list can be treated as a set. Here are a few examples. First we define A and B using the 
divisor function. This gives you all positive integers that divide evenly into a given positive integer, such as 525. 

A= Divisors [525] 

(1. 3. 5, 7, 15. 21. 25, 35, 75, 105, 175, 525) 
B = Divisors [300] 

(1, 2, 3, 4, 5, 6, 10, 12, 15, 20, 25, 30, 50, 60, 75, 100, 150, 300} 
Here are the union and intersection of the two sets: 
Union [A, B] 
(1, 2, 3, 4, 5, 6, 7, 10, 12, 15, 20, 21, 25, 30, 35, 50, 60, 75, 100, 105, 150, 175, 300, 525} 
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Intersection [A, B] 

{1, 3,5, 15,25,75) 

Notice the outputs from both expressions automatically sort the elements in ascending order. The intersection is quite significant. It is the set 
of common divisors of both 525 and 300. The largest of these divisiors, 75, is the greatest common divisor of 525 and 300. 

Here is the complement of B with respect to A. 

Complement [A, B] 

(7,21,35, 105, 175,525) 
There is no built-in Symmetric Difference function, but it can be defined: 

SymmetricDif ference [X_, Y_] := Complement [Union [X, Y] , Intersection [X, Y] ] 
Now we can use the function: 

SymmetricDif ference [A, B] 

{2, 4, 6, 7, 10, 12, 20, 21, 30, 35, 50, 60, 100, 105, 150, 175, 300, 525) 
The empty set is { ). 

SymmetricDif ference [A, A] 

{} 

What corresponds to set-builder notation in Mathematicci is the function Select. For example, the set of all integers from 1 to 1000 whose 
digits add up to 10 could be described in set builder notation as {n e Z | 1 < n < 1000, sum of digits of n = 10). We can produce that set 
using Select. 

Select [Range [1, 1000], Function[n, Apply[Plus, IntegerDigits [n] ] == 10]] 

{19, 28, 37, 46, 55, 64, 73, 82, 91, 109, 118, 127, 136, 145, 154, 163, 172, 181, 190, 208, 217, 226, 
235, 244, 253, 262, 271, 280, 307, 316, 325, 334, 343, 352, 361, 370, 406, 415, 424, 433, 442, 451, 460, 
505, 514, 523, 532, 541, 550, 604, 613, 622, 631, 640, 703, 712, 721, 730, 802, 811, 820, 901, 910) 

An alternate form of the second argument to this expression, using a "pure function" is 

Select [Range [1, 1000], Apply[Plus, IntegerDigits [B] ] == 10 &] 

(19, 28, 37, 46, 55, 64, 73, 82, 91, 109, 118, 127, 136, 145, 154, 163, 172, 181, 190, 208, 217, 226, 
235, 244, 253, 262, 271, 280, 307, 316, 325, 334, 343, 352, 361, 370, 406, 415, 424, 433, 442, 451, 460, 
505, 514, 523, 532, 541, 550, 604, 613, 622, 631, 640, 703, 712, 721, 730, 802, 811, 820, 901, 910) 

EXERCISES FOR SECTION 1.2 
A Exercises 

1. LetA = (0, 2, 3),B = (2, 3),C = (1, 5, 9), and let the universal set be U = (0, 1, 2, ..., 9). Determine: 

(a) A n B 

(b) A U B 

(c) B U A 

(d) A U C 

(e) A - B 
(f)B-A 

(g) A c 
(h) C c 

« a n C 

(j) Aefi 

2. Let A, B, and C be as in Exercise 1, let D = (3, 2), and let E = (2, 3, 2). Determine which of the following are true. Give reasons for 
your decisions. 
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(a) A = B 

(b) B = C 

(c) B = D 

(d) E = D 

(e) A n B = B n A 

(f)AUS = SUA 
(g)A-B = B-A 

(ii)AeB = seA 

3 . Let U= {1,2,3, ... ,9}. Give examples of sets A,B, and C for which: 

(a) a n (b n Q = (a n b) n c 
w a n (b u Q = (a n m u (a n o 

(c) (A U B) r =A c ns r 

(d) A U A' = (/ 

(e) A c A IJ B 

(f) AflS cA 

4. Let (/ = {1, 2, 3, . .., 9). Give examples to illustrate the following facts: 

(a) If A c B and B c C, then A c C. 

(b) A - B =/= B - A 

(c) If U = A U B and A fl B = , it always follows that A = U - B. 

(d) A x (B fl Q = (A x B) fl (A xC) 

B Exercises 

5. What can you say about A if U = {1, 2, 3, 4, 5),B = {2, 3), and (separately) 

(a) A UB = U, 2, 3,4) 

(b) A fl B = {2} 

(c) AeB = {3, 4, 5} 

6. Suppose that (/ is an infinite universal set, and A and B are infinite subsets of V ' . Answer the following questions with a brief explanation. 

(a) Must A c be finite? 

(b) Must A IJ B infinite? 

(c) Must A fl B be infinite? 

7. Given that U = all students at a university, D = day students, M = mathematics majors, and G = graduate students. Draw Venn diagrams 
illustrating this situation and shade in the following sets: 

(a) evening students 

(b) undergraduate mathematics majors 

(c) non-math graduate students 

(d) non-math undergraduate students 

8. Let the sets £>,M, G, and (7 be as in exercise 7. Let | U \ = 16, 000, | D | = 9,000, \M \ = 300, and \G \ = 1, 000. Also assume that 
the number of day students who are mathematics majors is 250, fifty of whom are graduate students, that there are 95 graduate mathematics 
majors, and that the total number of day graduate students is 700. Determine the number of students who are: 

(a) evening students 



(b) nonmathematics majors 

(c) undergraduates (day or evening) 
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(d) day graduate nonmathematics majors 

(e) evening graduate students 

(f) evening graduate mathematics majors 

(g) evening undergraduate nonmathematics majors 

C Exercise 

9. (a) Evaluate the following expressions in Mathematica to learn more about Select and to learn about PrimeQ. 

? Select 
? PrimeQ 
(b) Use Mathematica to list all primes between 2000 and 2099, inclusive. 

10. (a) Evaluate the following expression in Mathematica to learn about SquareFree 

? SquareFreeQ 
(b) Use Mathematica to list all square-free integers between 2000 and 2099, inclusive. 
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1 1 .3 Cartesian Products and Power Sets 



Definition: Cartesian Product. Let A and B be sets. The Cartesian product of A and B, denoted by AxB, is defined as follows: 
AxB = {(a, b) | a e A andb e B), that is, AxB is the set of all possible ordered pairs whose first component comes from A and whose 
second component comes from B. 

Example 1.3.1. Notation in mathematics is often developed for good reason. In this case, a few examples will make clear why the symbol 
x is used for Cartesian products . 

(a) UtA = {1,2,3} and B = {4,5}.Then AxB = {(l,4),(l,5),(2,4),(2,5),(3,4),(3,5)}.Note that |AxB| =6 = \A\ x \B\. 

(b)AxA = ((1, 1), (1, 2), (1, 3), (2, 1), (2, 2), (2, 3), (3, 1), (3, 2), (3, 3)). Note that |AxA| = 9 = |A| 2 . 
These two examples illustrate the general rule: If A and B are finite sets, then | A xB | =6 = \A\ x \B\. 

We can define the Cartesian product of three (or more) sets similarly. For example, A x B x C = {(a, b, c) : a <= A, b e B, c e C). It is 
common to use exponents if the sets in a Cartesian product are the same: 

A 2 =AxA , 
A 3 = A xA xA 



and in general, 



A" = {(ai , a2 , ... , a„) : each a e A). 



Power Sets 

Definition: Power Set. If A is any set, the power set of A is the set of all subsets of A, denoted P(A). 
The two extreme cases, theempty set and all of A, are both included in P(A). 
Example 1.3.2. 

(a) P(0) = I0) 

(b) ?>(U}) = (0, (D) 

(c) P({1,2)) = {0, (1), (2), {1, 2)). 

We will leave it to you to guess at a general formula for the number of elements in the power set of a finite set. In Chapter 2, we will discuss 
counting rules that will help us derive this formula. 

EXERCISES FOR SECTION 1.3 
A Exercises 

1. LetA = {0, 2, 3),£ = {2, 3),C = (1, 4), and let the universal set be U = (0, 1, 2, 3, 4). List the elements of 

(a) A xB 

(b) BxA 

(c) A xBxC 

(d) U x0 
(e) A x A c 

ffiB 2 

(g) B 3 

(h) Bxf(B) 

2 . Suppose that you are about to flip a coin and then roll a die . Let A = {HEAD , TAIL} and B = {1,2,3,4,5, 6}. 

(a) What is \AxB\l 

(b) How could you interpret the set A x B 1 

3. List all two-element sets in V({a, b, c\) 

4. List all three-element sets in V ({a, b, c, d)). 
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5 . How many singleton (one-element) sets are there in P(A) if | A | = n ? 

6. A person has four coins in his pocket: a penny, a nickel, a dime, and a quarter. How many different sums of money can he take out if he 
removes 3 coins at a time? 

7. LetA = { + , -}andfl = {00, 01, 10, 11). 

(a) List the elements ofA xB 

(b) How many elements do A 4 and (A x B) 3 have? 

B Exercises 

8. LetA = {•, □, ®)andB = {□, e, •). 

(a) List the elements of A x B and B x A. The parentheses and comma in an ordered pair are not necessary in cases such as this where the 
elements of each set are individual symbol. 

(b) Identify the intersection of A x B and B x A. for the case above, and then guess at a general rule for the intersection of A x B and B x A. 
where A and B are any two sets . 

9. Let A and B be nonempty sets. When are A x B and B x A. equal? 
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I 1.4 Binary Representation of Positive Integers 



Recall that the set of positive integers, P, is {1, 2, 3, . . . }. Positive integers are naturally used to count things. There are many ways to count 
and many ways to record, or represent, the results of counting. For example, if we wanted to count five hundred twenty-three apples, we might 
group the apples by tens. There would be fifty-two groups of ten with three single apples left over. The fifty-two groups of ten could be put into 
five groups of ten tens (hundreds), with two tens left over. The five hundreds, two tens, and three units is recorded as 523. This system of 
counting is called the base ten positional system, or decimal system. It is quite natural for us to do grouping by tens, hundreds, thousands, . . . , 
since it is the method that all of us use in everyday life. 

The term positional refers to the fact that each digit in the decimal representation of a number has a significance based on its position. Of course 
this means that rearranging digits will change the number being described. You may have learned of numeraton systems in which the position 
of symbols does not have any significance (e.g., the ancient Egyptian system). Most of these systems are merely curiosities to us now. 

The binary number system differs from the decimal number system in that units are grouped by twos, fours, eights, etc. That is, the group sizes 
are powers of two instead of powers of ten. For example, twenty-three can be grouped into eleven groups of two with one left over. The eleven 
twos can be grouped into five groups of four with one group of two left over. Continuing along the same lines, we find that twenty-three can be 
described as one sixteen, zero eights, one four, one two, and one one, which is abbreviated 10 1 1 l two , or simply 101 1 1 if the context is clear. 

The process that we used to determine the binary representation of 23 can be described in general terms to determine the binary representation 
of any positive integer n. A general description of a process such as this one is called an algorithm. Since this is the first algorithm in the book, 
we will first write it out using less formal language than usual, and then introduce some "algorithmic notation." 

Step One: Start with an empty list of bits. 

Step Two: Assign the variable k the value n. 

Step Three: While k's value is positive, continue doing the following three steps, and when k becomes zero, stop. First, divide k by 2, obtaining 
a quotient q (often denoted k div 2) and a remainder r (denoted k mod 2). Second, attach r to the left-hand side of the list of bits. Third, assign 
the variable k the value of q. 

Example 1.4.1. Determine the binary representation of 41. 

k = 2 X q + r — > add r to the list 

41 = 2x20+ 1 List= 1 

20 = 2 x 10 + List = 01 

10 = 2x5 + List = 001 

5 = 2x2+1 List =1001 

2 =2x 1 + List = 01001 

1 = 2 xO + 1 List= 101001 

Therefore, 41 = 101001 two 

The notation that we will use to describe this algorithm and all others is called pseudocode, an informal variation of the instructions that are 
commonly used in many computer languages. Read the following description carefully, comparing it with the informal description above. 
Appendix B, which contains a general discussion of the components of the algorithms in this book, should clear up any lingering questions. 
Anything after // are comments . 

Algorithm 1.4.1: Algorithm for Determining the Binary Representation of a Positive Integer. 

Input: a positive integer n. 

Output: the binary representation of n in the form of a list of bits, with units bit last, twos bit next to last, etc. 

(1) k:=n //initialize k 

(2) L := an empty list //initialize L 

(3) While k > do 

(3.1) q := k div 2 //divide k by 2 
r:= k mod 2 

(3.2) L: = prepend r to L //Add r to the front of L 
(3.3)k:=q //reassign k 
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Now that you've read this section, you should get this joke from the xkcd.com . 



ON ASCALE0F1T& 10, 
HOW LIKELY IS \T TW*T 

this question! is 

USING BNART? 
WHAT'S AH? ) 




EXERCISES FOR SECTION 1.4 
A Exercises 

1. Find the binary representation of each of the following positive integers: (a) 31 (b) 32 (c) 10 (d) 100 

2. Find the binary representation of each of the following positive integers: (a) 64 (b) 67 (c) 28 (d) 256 

3. What positive integers have the following binary representations? (a) 10010 (b) 1001 1 (c) 101010 (d) 1001 1 1 10000 

4. What positive integers have the following binary representations? (a) 100001 (b) 1001001 (c) 1000000000 (d) 1001 1 10000 

5. The example, decimal 1000 has 10 bits since it is 1111 101000 two You might save some time by thinking of how 10 can be arrived at 
without finding the exact binary representation. How many bits do the binary representations of the following decimal numbers have? 

(a) 201 1 (b) 4000 (c) 4500 (d) 2 50 

B Exercises 

6. Let m be a positive integer with rc-bit binary representation: a„_i a„_2 ••• «i Qo with a„_i = 1 What are the smallest and largest values that m 
could have? 

7. If a positive integer is a multiple of 100, we can identify this fact from its decimal representation, since it will end with two zeros. What can 
you say about a positive integer if its binary representation ends with two zeros? 

8 . Can a multiple of ten be easily identified from its binary representation? 
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1 1.5 Summation Notation and Generalizations 



Most operations such as addition of numbers are introduced as binary operations. That is, we are taught that two numbers may be added 
together to give us a single number. Before long, we run into situations where more than two numbers are to be added. For example, if four 
numbers, a\, a 2 , 03, and 04 are to be added, their sum may be written down in several ways, such as ((ai + a 2 ) + 03) + 04 or 
{d\ + a 2 ) + (03 + 04). In the first expression, the first two numbers are added, the result is added to the third number, and that result is added to 
the fourth number. In the second expression the first two numbers and the last two numbers are added and the results of these additions are 
added. Of course, we know that the final results will be the same. This is due to the fact that addition of numbers is an associative operation. For 
such operations, there is no need to describe how more than two objects will be operated on. 

4 
A sum of numbers such as a\ + a 2 + 03 + 04 is called a series and is often written £ a k in what is called summation notation. 

k=\ 

We first recall some basic facts about series that you probably have seen before. A more formal treatment of sequences and series is covered in 
Chapter 8 . The purpose here is to give the reader a working knowledge of summation notation and to carry this notation through to intersection 
and union of sets and other mathematical operations . 

n 

A finite series is an expression such as a\ + a 2 + 03 + ... + a n = £ a k 

k=\ 

In 2 a„ / is referred to as the index, or the index of summation; the value below the summation symbol is the initial index and the value above 
the summation symbol is the terminal index. The a, are called the terms of the series. The initial index of a summation may be different from 1 . 

Example 1.5.1. 

4 

(a) J! a i - a l + a 2 + a 3 + °4 

5 

(b) E b k = b + b l +b 2 +b 3 + b A + b 5 

k=0 

2 
(C) £ Ci = C-2 + C-\ + Co + C\ + c 2 
i=-2 

Example 1.5.2. If the terms in series are more specific, the sum can often be simplified. For example, 

4 

(a) £ i 2 = l 2 + 2 2 + 3 2 + 4 2 = 30 

5 

(b) £ (2 i - 1) = (2 x 1 - 1) + (2 x 2 - 1) + (2 x 3 - 1) + (2 x 4 - 1) + (2 x 5 - 1) 
;=i 

=1+3+5+7+9 

= 25 

Summation notation can be generalized to many mathematical operations, for example, 

A 1 r)A 2 r)A 3 r)A 4 = n^ 

Definition: Generalized Set Operations. Let A h A 2 , ..., A n be sets, then: 

(1) A l C]A 2 C]---C]A n = f]A, 

i=l 

(1) A,\JA 2 [J ■■■[JA„ = \JA l 

i=l 

n 

(1) A, xA 2 x ■■■xAn = x Ai 

i=l 

(i) A,eA 2 e---eA„ = ®A t 

i=l 

Example 1.5.3. If A, = {0, 2, 3},A 2 = {1, 2, 3, 6),andA 3 = {-1, 0, 3, 9), then 
f]A i = A i f]A 2 f]A 3 ={3} 



and 



UA,=A,UA 2 UA 3 = {-1,0, 1,2,3,6,9) 
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With this notation it is quite easy to write lengthy expressions in a fairly compact form. For example, the statement 

a n (Bi u b 2 u ■ ■ ■ u s„) = (a n so u (a n b 2 ) u • • • u (a n b„) 

becomes 

AnfuB,]= u(AnB„) 

V;=i I ,=i 

EXERCISES FOR SECTION 1.5 
A Exercises 

1 . Calculate the following series: 

(a) | (2 + 3 i) 

(b) Z r 

(c) £ 2^ for re= 1, 2, 3, 4 

(d) £(2£- 1) for n = 1, 2, 3, 4 
*t=i 



2. Calculate the following series: 



(a) 


3 

Z i" for n = 1 , 

i=l 


2, 


3, 


4 






(b) 


Z20 

i=i 












(c) 


3 

Z (n' + 1) for 

y=0 


n : 


= 1 


, 2, 


3, 


4 


(d) 


n 

Z k for « = 


: 1, 


. 2, 


3, 


4 





3. (a) Express the formula Z = — without using summation notation. 

(b) Verify this formula for n = 3 . 

(c) Repeat parts (a) and (b) for Z i' 3 = — 

i=i 4 



4. Verify the following properties for n = 3. 



n n 



(a) 5>i+ &,-) =2a; + Zbi 

i=l i=l i=l 



n \ n 

£«1 = Z ca i 

!=1 / 1=1 



(b) c 

Vi=i y ;=i 

5. Rewrite the following without summation sign for n = 3. It is not necessary that you understand or expand the notation ] at this point. 

(x + y)"= if ")x n - k y k 

4 

6. (a) Draw the Venn diagram for f]Aj. 

i=i 

(b) Express in "expanded format": 

AufrWI- ncAUSn). 

Vi=l / i=l 
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7. For any positive integer k, let A k = {x e Q : k - 1 < x < k\ and B|, = |j;eQ: -i<i<i|. What are the following sets? 

(a) IM, 

(b) UB; 

i=l 

(c) f]At 

i=i 

(d) flB, 
1=1 

8. For any positive integer k, let A = {xeQ: < x < l/k\andB k = {xeQ: < x < k\. What are the following sets? 

(a) \JA, 
i=i 

(b) US; 

i=i 

(c) f]A, 
i=i 

(d) nfi, 
i=i 

9. The symbol n is used for the product of numbers in the same way that £ is used for sums. For example, 

5 

| | Xj = X\ X2 X3 X4 Xg 

1 = 1 

Evaluate the following: 

(a) W 2 

i=i 

(b) n(2/+l) 

i=i 

10. Evaluate 

(a) fl2* 

t=o 

100 . 

»' n=i 
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SUPPLEMENTARY EXERCISES FOR CHAPTER 1 
Section 1.1 

1 . Enumerate the elements in the following sets: 

(a) {ieR \x 2 - 3x + 2 = 0} 

(b) {x e R | x- + 1 =0} 

(c) {ieC|r + 1=0} 

Section 1.2 

2. Lett/ = {0, 1,2, 3 9), A = {0,2, 3},B = {2, 3},C = (1,5,9). 

Determine: 

(a) (A U B) n C 

(b) A c n B" 

(c) (Ausr 

3. Let U={0, l,2,3,4,5,6,7,8,9},A = {xe(7:xisamultipleof3},andB = {xe£/:x 2 - 5 > 0}. 
Determine: 

(a) A U B 

(b)AflB 

(c)B c 

4. Let A,B, and C be subsets of some universal set U. Draw Venn diagrams to illustrate each of the following expressions: 

(a) (A n BY (e) A c f] B< fl C c 

(b) A c U B c (f) A'' n B'' U C c 

(c) A n (B U O (g) A r n (B U Cf 

(d) (a n b) u (a n c) (h) (a c n b c ) u (A r n c c ) 

5. Let A = {1,2,3,4 ,5,6}, B = {2,5,7 ,9, 12}, and C= {4 ,7 ,9, 15,23}. 

(a) Verify the formula: \A \J B\ = |A| + |B| - \Af)S\ for this 

example. 

(b) Use the formula in part (a) to compute: 

|AUC|,|BUC|,|AnB| 
(c) Use the formula of part (a) to derive a formula for \A U B (J C|. 
Verify your formula for this example. 

6. [/={l,2,3,4,5,6,7,8},A = {ae U | a 2 is even}, and B = {a e f/ I a + 1 is a multiple of three} 

(a) A = { } (List) 

(b) B = { } (List) 

(c) A c = 

(d) A n B = 

Section 1.3 

7. Let U ={0,1, 2, 3,..., 9}, A = {2,4,6}, B = {4}, C = {1,5}. 
Determine: 

(a) B 2 (d)BxA 

(b) A x 5 (e) (A x B) x C 
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(c) B 3 

8. Let A, B, and C be sets where | A | = 2, | B | = 3, and |C| =4. 

(a) Give an example showing that Ax B x C * AxCxB, and explain why the two sets are not equal. 
(b)Is \AxBxC\= \A x C xB\ 

9 . Let A = { 1 , 2} . Determine |P(A)xf (A) | and then list all elements of 9(A) x 9(A) . 

10. Let A = {a, b, c, d}. Determine which of the following are true: 

(a) {b} e 9(A) (d) b e 9(A) 

(b) A e 9(A) (e)0e9(A) 

(c) !P(A) e !P(A) (f) !P(A) c 9(A) 

Section 1.4 

1 1 . The addition rules for binary numbers are: 

+ = 0, 

1+0 = 0+1 = 1, 

1 + 1=0 and carry 1 to the next column 
and 1 + 1 + 1 = 1 and carry 1 to the next column. 
For example: 11111 < — carries 

3i te „= inn 

ll ten = 1011 

101010 = 42 tc „ 
Express and compute the following sums with binary numbers; verify your result by converting back to base ten. 
(a) 31 + 32 (b) 64 +- 1 1 (c) 13 + 15 + 9 

12. Compute the following sums; express the result in decimal form. 

(a) 10010 + 10011 (c) 1101.11 + 100.1 

(b) 101001 + 1101 

13. Multiplication of binary numbers is a process similar to decimal multiplication. The rules for binary multiplication are: 0-0 = 0, 1-0 = 0- 
1 =0, 1-1 = 1. Compute the following products and verify by converting numbers to base ten: 

(a) 1001 -11 (b) 1001 -1101 

14. How are multiplication and division by two accomplished in the binary number system? 

Section 1.5 

15. Let A„ = {k e N : k < 3 n} for each neN. List the elements, 
(a) A„for« = 0, 1,2,3 



(b) 


f]A„ 




n=l 


(c) 


r\A„ 

n=0 


(d) 


UA n 




n=i 


(e) 


U4 

n=0 



16. Expand each of the following and convince yourself that they are true. Assume that m, n e N with m < n, and c, x„ e (R. 
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n n-\ 



(a) 2_fi = 2_jXj + x„ 

i=l i=l 

n n n 

(b) ^(x; + yd = Yfi + ^/; 
i=i i=i 

(c) 2_fXj = c2_/i 



1=1 1=1 



1=1 



n m 



(d) ^\, = Yj°i + X X/ 

i=\ i=\ i=m+l 

17. Which of the above are true if + is replaced with • and H is replaced with fl? Explain. 
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chapter 2 




COMBINATORICS 



GOALS 

Throughout this book we will be counting things. In this chapter we will outline some of the tools that will help us count. 

Counting occurs not only in highly sophisticated applications of mathematics to engineering and computer science but also in many basic 
applications. Like many other powerful and useful tools in mathematics, the concepts are simple; we only have to recognize when and how they 
can be applied. 

I 2.1 Basic Counting Techniques— The Rule of Products 

WHAT IS COMBINATORICS? 

One of the first concepts our parents taught us was the "art of counting." We were taught to raise three fingers to indicate that we were three 
years old. The question of "how many" is a natural and frequently asked question. Combinatorics is the "art of counting." It is the study of 
techniques that will help us to count the number of objects in a set quickly. Highly sophisticated results can be obtained with this simple 
concept. The following examples will illustrate that many questions concerned with counting involve the same process. 

Example 2.1.1. A snack bar serves five different sandwiches and three different beverages. How many different lunches can a person 
order? One way of determining the number of possible lunches is by listing or enumerating all the possibilities. One systematic way of doing 
this is by means of a tree, as in Figure 2.1.1. 







HIkt 



IKW 









II 

fiwMr 
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Start 




15 Choices 



FIGURE 2.1.1 Tree solution for Example 2.1.1 

Every path that begins at the position labeled START and goes to the right can be interpreted as a choice of one of the five sandwiches 
followed by a choice of one of the three beverages. Note that considerable work is required to arrive at the number fifteen this way; but we 
also get more than just a number. The result is a complete list of all possible lunches. If we need to answer a question that starts with "How 
many . . . ," enumeration would be done only as a last resort. In a later chapter we will examine more enumeration techniques. 

An alternative method of solution for this example is to make the simple observation that there are five different choices for sandwiches and 
three different choices for beverages, so there are 5 • 3 = 15 different lunches that can be ordered. 

A listing of possible lunches a person could have is: {(BEEF, milk), (BEEF, juice), (BEEF, coffee), ..., (BOLOGNA, coffee)). 

Example 2.1.2. Let A = {a, b, c, d, e) and B = (1,2, 3). From Chapter 1 we know how to list the elements in 
A x B = {(a, 1), (a, 2), (a, 3), ..., (e, 3)). The reader is encouraged to compare Figure 2.1.1 for this example. Since the first entry of 
each pair can be any one of the five elements a, b, c, d, and e, and since the second can be any one of the three numbers 1 , 2, and 3, it is quite 
clear there are 5 • 3 = 15 different elements in A x B. 

Example 2.1.3. A person is to complete a true-false questionnaire consisting often questions. How many different ways are there to 
answer the questionnaire? Since each question can be answered either of two ways (true or false) , and there are a total of ten questions , there 
are 2-2-2-2-2-2-2-2-2-2 = 2 10 = 1024 different ways of answering the questionnaire. The reader is encouraged to visualize the tree 
diagram of this example, but not to draw it! 

We formalize the procedures developed in the previous examples with the following rule and its extension. 

THE RULE OF PRODUCTS 

Rule Of Products: If two operations must be performed, and If the first operation can always be performed p\ different ways and the second 
operation can always be performed p 2 different ways, then there are p\-p 2 different ways that the two operations can be performed. 

Note: It is important that p2 does not depend on the option that is chosen in the first operation. Another way of saying this is that p2 is indepen- 
dent of the first operation. If p 2 is dependent on the first operation, then the rule of products does not apply. 

Example 2.1.4. Assume in Example 2.1.1 that coffee is not served with a beef or chicken sandwich, then by inspection of Figure 2.1.1 
we see that there are only thirteen different choices for lunch. The rule of products does not apply, since the choice of beverage depends on 
one's choice of a sandwich. 
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Extended Rule Of Products. The rule of products can be extended to include sequences of more than two operations. If n operations must be 

performed, and the number of options for each operation is p\, P2, ■■■ , and p„, respectively, with each p t independent of previous choices, then 
the n operations can be performed 

n 
Pl-Pl Pn= n Pi 

1=1 

Example 2.1.5. A questionnaire contains four questions that have two possible answers and three questions with five possible answers. 
Since the answer to each question is independent of the answers to the other questions , the extended rule of products applies and there are 
2-2-2-2-5-5-5 = 2 4 5 3 = 2000 different ways to answer the questionnaire. In Chapter 1 we introduced the power set of a set A, P(A), 
which is the set of all subsets of A. Can we predict how many elements are in P(A) for a given finite set A? The answer is yes, and in fact 
| V (A) | = 2" . The ease with which we can prove this fact demonstrates the power and usefulness of the rule of products . Do not underesti- 
mate the usefulness of simple ideas. 

Theorem 2.1.1. If A is a finite set, then | P(A) | =2l A l . 

Proof: Let B e P(A) and assume \A\ = n. Then for each element x e A there are two choices, either leBofi £ B. Since there are n 
elements of A we have , by the rule of products , 2-2 2 = 2" different subsets of A. Therefore, | P(A) | = 2" ■ 

n factors 

EXERCISES FOR SECTION 2.1 
A Exercises 

1 . In horse racing, to bet the "daily double" is to select the winners of the first two races of the day. You win only if both selections are correct. 
In terms of the number of horses that are entered in the first two races, how many different daily double bets could be made? 

2. Professor Shortcut records his grades using only his students' first and last initials. What is the smallest class size that will definitely force 
Prof. S. to use a different system? 

3. A certain shirt comes in four sizes and six colors. One also has the choice of a dragon, an alligator, or no emblem on the pocket. How many 
different shirts could you order? 

4. A builder of modular homes would like to impress his potential customers with the variety of styles of his houses. For each house there are 
blueprints for three different living rooms, four different bedroom configurations, and two different garage styles. In addition, the outside can 
be finished in cedar shingles or brick. How many different houses can be designed from these plans? 

5. The Pi Mu Epsilon mathematics honorary society of Outstanding University wishes to have a picture taken of its six officers. There will be 
two rows of three people. How many different way can the six officers be arranged? 

6. An automobile dealer has several options available for each of three different packages of a particular model car: a choice of two styles of 
seats in three different colors , a choice of four different radios , and five different exteriors . How many choices of automobile does a customer 
have? 

7. A clothing manufacturer has put out a mix-and-match collection consisting of two blouses, two pairs of pants, a skirt, and a blazer. How 
many outfits can you make? Did you consider that the blazer is optional? How many outfits can you make if the manufacturer adds a sweater to 
the collection? 

8. As a freshman, suppose you had to take two of four lab science courses, one of two literature courses, two of three math courses, and one of 
seven physical education courses. Disregarding possible time conflicts, how many different schedules do you have to choose from? 

9. (a) Suppose each single character stored in a computer uses eight bits. Then each character is represented by a different sequence of eight 0's 
and l's called a bit pattern. How many different bit patterns are there? (That is, how many different characters could be represented?) 

(b) How many bit patterns are palindromes (the same backwards as forwards)? 

(c) How many different bit patterns have an even number of l's? 

10. Automobile license plates in Massachusetts usually consist of three digits followed by three letters. The first digit is never zero. How many 
different plates of this type could be made? 

11. (a) Let A = {a, b, c, d). Determine the number of different subsets of A. 
(b)LetA = {1, 2, 3, 4, 5). Determine the number of proper subsets of A. 

12. How many integers from 100 to 999 can be written with no 7's? 

13. Consider three persons, A, B, and C, who are to be seated in a row of three chairs. Suppose A and B are identical twins. How many seating 
arrangements of these persons can there be: 

(a) If you are a total stranger? 

(b) If you are A and B's mother? 

(This problem is designed to show you that different people can have different correct answers to the same problem.) 

14. How many ways can a student do a ten-question true-false exam if he or she can choose not to answer any number of questions? 

15. Suppose you have a choice of fish, lamb, or beef for a main course, a choice of peas or carrots for a vegetable, and a choice of pie, cake, or 
ice cream for dessert. If you must order one item from each category, how many different dinners are possible? 
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16. Suppose you have a choice of vanilla, chocolate, or strawberry for ice cream, a choice of peanuts or walnuts for chopped nuts, and a choice 
of hot fudge or marshmallow for topping. If you must order one item from each category, how many different sundaes are possible? 

B Exercises 

17. A questionnaire contains six questions each having yes-no answers. For each yes response, there is a follow-up question with four possible 
responses. 

(a) Draw a tree diagram that illustrates how many ways a single question in the questionnaire can be answered, 
(b) How many ways can the questionnaire be answered? 

18. Ten people are invited to a dinner party. How many ways are there of seating them at a round table? If the ten people consist of five men 
and five women, how many ways are there of seating them if each man must be surrounded by two women around the table? 

19. How many ways can you separate a set with n elements into two nonempty subsets if the order of the subsets is immaterial? What if the 
order of the subsets is important? 

20. A gardener has three flowering shrubs and four nonflowering shrubs. He must plant these shrubs in a row using an alternating pattern, that 
is, a shrub must be of a different type from that on either side. How many ways can he plant these shrubs? If he has to plant these shrubs in a 
circle using the same pattern, how many ways can he plant this circle? Note that one nonflowering shrub will be left out at the end. 
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I 2.2 Permutations 



A number of applications of the rule of products are of a specific type, and because of their frequent appearance they are given a designation all 
their own — permutations. Consider the following examples. 

Example 2.2.1. How many different ways can we order the three different elements of the set A = {a, b, c}? Since we have three choices 
for position one, two choices for position two, and one choice for the third position, we have, by the rule of products, 3-2-1 = 6 different ways 
of ordering the three letters . We illustrate through a tree diagram: 



>ii -i-tmw , 



Figure 2.2.1 

Each of the six orderings is called a permutation of the set A. 

Example 2.2.2. A student is taking five courses in the fall semester. How many different ways can the five courses be listed? There are 
5x4x3x2x1= 120 different permutations of the set of courses. 

In each of the above examples of the rule of products we observe that: 

(1) We are asked to order or arrange elements from a single set. 

(2) Each element is listed exactly once in each list (permutation). So if there are n choices for position one in a list, there are n - 1 choices for 
position two, n - 2 choices for position three, etc. 

Example 2.2.3. The alphabetical ordering of the players of a baseball team is one permutation of the set of players. Other orderings of 
the players' names might be done by batting average, age, or height. The information that determines the ordering is called the key. We would 
expect that each key would give a different permutation of the names. If there are twenty-five players on the team, there are 
25-24-23 2-1 different permutations of the players . 

We now develop notation that will be useful for permutation problems . 

Definition: Factorials. If "n is a positive integer then n factorial is the product of the first n positive integers and is denoted ni. Addition- 
ally, we define zero factorial to be 1 . 

0! = 1 

1! = 1 

2! =2-1=2 

3! = 3-2-1 =6 

n\ = n-(n- l)-(n-2) 2 • 1 = fl* 

k=\ 

Note that 4 ! is 4 times 3 !, or 24, and 5 ! is 5 times 4 !, or 120. In addition, note that as n grows in size, n ! grows extremely quickly. For 
example, 11 ! = 39916800. If the answer to a problem happens to be 25 !, for example, you would never be expected to write that number out 
completely. However, 

25' 

a problem with an answer of — - can be reduced to 25 • 24, or 600. 

r 23! 

So if | A | = n, there are n\ ways of permuting all n elements of A. We next consider the more general situation where we would like to 
permute k elements of a set of n objects, k < n. 

Example 2.2.4. A club of twenty-five members will hold an election for president, secretary, and treasurer in that order. Assume a 
person can hold only one position. How many ways are there of choosing these three officers? By the rule of products there are 25 • 24 • 23 
ways of making a selection. 

Definition: Permutation. An ordered arrangement ofk elements selected from a set of n elements, < k < n, where no two elements of 
the arrangement are the same, is called a permutation of n objects taken k at a time. The total number of such permutations is denoted by 
P(n; k). 

Theorem 2.2.1. The number of possible permutations of k elements taken from a set of n elements is 

P(n;k) = n-(n-l)-(n-2) {n-k+l)= -=*- = Z (n - f) 

( " '■ j=0 

Proof: Case I: li k = nwehaveP(«; ri) = — — = «!, which is simply the rule of products as applied in Example 2.2.3. 
Case U:lf0<k<n then, as in Example 2.2.4, we have k positions to fill with n elements and 
position 1 can be filled by any one of n elements 
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position 2 can be filled by any one of n - 1 elements 
position 3 can be filled by any one of n - 2 elements 

position k can be filled by any one of n - k + 1 elements. 
Hence, by the rule of products, P (n; k) = n -{n - \)-(n - 2) (n - k + l).Also, 

n\ _ nin-l)in-2) (n-k+l)in-k) 2-1 

(n-k)\ (n-k)\ 

= n-{n - 1) • O - 2) (n - k + 1) ■ 

It is important to note that the derivation of the permutation formula given in Theorem 2.2.1 was done solely through the rule of products. This 
serves to reiterate our introductory remarks in this section that permutation problems are really rule-of-products problems. Every permutation 
problem can be done by the rule of products. We close this section with several examples solved by both methods. 

Example 2.2.5. A club has eight members eligible to serve as president, vice-president, and treasurer. How many ways are there of 
choosing these officers? 

Solution 1: Using the rule of products. There are eight possible choices for the presidency, seven for the vice-presidency, and six for the 
office of treasurer. By the rule of products there are 8x7x6 = 336 ways of choosing these officers. 

Solution 2: Using the permutation formula. We want the total number of permutations of eight objects taken three at a time: 

P(8,3)=^= 8x7x6x5! =8-7-6 =336 

v ' (8-3)! 5! 

Example 2.2.6. Example 2.2.2 revisited. Solution 2: Using the permutation formula. We want the total number of permutations of five 
courses taken five at a time: 

P(5; 5) = -2—= -= 120 

v ' (5-5)! 0! 

Example 2.2.7. Consider the digits 1,2,3,4, and 5. 

(a) How many three-digit numbers can be formed if no repetition of digits can occur? 

(b) How many three-digit numbers can be formed if repetition of digits is allowed? 

Solution to (a): 

Solution 1 : Using the rule of products . We have any one of five choices for digit one , any one of four choices for digit two , and three choices 
for digit three. Hence, 5-4-3 = 60 different three-digit numbers can be formed. 

Solution 2; Using the permutation formula. We want the total number of permutations of five digits taken three at a time: 

51 5 Wiyly?y1 

P(5; 3) = -^— = - = 12L±2i£2ii2ii = 5x4x3 = 60 

(5-3)! 2! 2x1 

Solution to (b): The definition of permutation indicates ". . .no two elements in each list are the same." Hence the permutation formula cannot 
be used. However, the rule of products still applies. We have any one of five choices for the first digit, five choices for the second, and five 
for the third. So there are 5-5-5 = 125 possible different three-digit numbers if repetition is allowed. 

EXERCISES FOR SECTION 2.2 
A Exercises 

1 . If a raffle has three different prizes and there are 1 ,000 raffle tickets sold, how many different ways can the prizes be distributed? 

2. (a) How many three-digit numbers can be formed from the digits 1 , 2, 3 if no repetition of digits is allowed? List the three-digit numbers. 

(b) How many two-digit numbers can be formed if no repetition of digits is allowed? List them. 

(c) How many two-digit numbers can be obtained if repetition is allowed? 

3 . How many eight-letter words can be formed from the 26 letters in the alphabet? Even without concerning ourselves about whether the words 
make sense, there are two interpretations of this problem. Answer both. 

4. Let A be a set with \ A \ = n . 

(a) Determine | A 3 | . 

(b) Determine | {(a, b, c) e A 3 | each coordinate is different) |. 

5. The state finals of a high school track meet involves fifteen schools. How many ways can these schools be listed in the program? 

6. Consider the three-digit numbers that can be formed from the digits 1,2,3,4,5 with no repetition of digits allowed, 
(a) How many of these are even numbers? 
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(b) How many are greater than 250? 

7. (a) How many ways can the coach at Tall U. fill the five starting positions on a basketball team if each of his 15 players can play any 
position? (b) What is the answer if the center must be one of two people? 

8 . (a) How many ways can a gardener plant five different species of shrubs in a circle? 

(b) What is the answer if two of the shrubs are the same? 

(c) What is the answer if all the shrubs are identical? 

9. The president of the Math and Computer Club would like to arrange a meeting with six attendees, the president included. There will be three 
computer science majors and three math majors at the meeting. How many ways can the six people be seated at a circular table if the president 
does not want people with the same majors to sit next to each other? 

B Exercises 

10. Six people apply for three identical jobs and all are qualified for the positions. Two will work in New York and the other one will work in 
San Diego. How many ways can the positions be filled? 

11. (a) Let A = {1, 2, 3, 4). Determine the cardinality of j(ai, 02) eA 2 |ai£a2} 

(b) What is the answer to part (a) if | A | = n ? 

(c) Assume that A is a set with cardinality n. Determine the number of m-tuples in A m where each coordinate is different from the other 
coordinates. Break your answer down into cases m > n and m < n. 
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I 2.3 Partitions of Sets and the Laws of Addition 



One way of counting the number of students in your class would be to count the number in each row and to add these totals. Of course this 
problem is simple because there are no duplications, no person is sitting in two different rows. The basic counting technique that you used 
involves an extremely important first step, namely that of partitioning a set. The concept of a partition must be clearly understood before we 
proceed further. 

Definition: Partition. A partition of set A is a set of one or more nonempty subsets of A: A h A 2 , ... such that: 

(a) A, (J A 2 U ■■■ = A and 

(b) the subsets are mutually disjoint: that is, A,- f] Aj = for i # j. 

The subsets in a partition are often referred to as blocks. Note how our definition allows us to partition infinite sets, and to partition a set into an 
infinite number of subsets . Of course , if A is finite the number of subsets can be no larger than | A | . 

Example 2.3.1. Let A = {a, b, c, d\. Three partitions of A are: 

1. {{a}, {b}, {c, d\\ 

2. {{a, b), (c, d}} 

3. {{a), {b}, {c}, {d}} 

Example 2.3.2. Two examples of partitions of Z are {{«) | n e Z) and {{n \ n e Z, n < 0}, {0}, {n \ n e Z, n > 0)). The set of subsets 
{{n e Z | n > 0), [n e Z | n < 0)) is not a partition because the two subsets have a nonempty intersection. A second example of a non- 
partition is {{n e Z : \n\ = k) \ k = -1, 0, 1, 2, ...}. One of the blocks, {n e Z : | n | = -1} is empty. 

One could also think of the concept of partitioning a set as a "packaging problem." How can one "package" a carton of, say, twenty-four cans? 
We could use: four six-packs, three eight-packs, two twelve-packs, etc. In all cases: (a) the sum of all cans in all packs must be twenty-four, and 
(b) a can must be in one and only one pack. 

Basic Law Of Addition: If A is a finite set, and if {A t , A 2 , ..., A n ) is a partition of A, then 

|a| = | a, I +|a 2 I +••• + |a„ I = £ \A k \ 

The basic law of addition can be rephrased as follows: If A is a finite set where A = A t {J A 2 {J ■■■ {J A„ and where A, f] Aj = whenever 
i i= j, then 

|A|= \A 1 {jA 2 U-UA n \ = \A 1 \ + \A 2 \+- + \A n \ 

Example 2.3.3. The number of students in a class could be determined by adding the numbers of students who are freshmen, sopho- 
mores, juniors, and seniors, and those who belong to none of these categories. However, you probably couldn't add the students by major, 
since some students may have double majors . 

Example 2.3.4. The sophomore computer science majors were told they must take one and only one of the following courses, Cryptogra- 
phy, Data Structures, or Java script, in a given semester. The numbers in each course, respectively, for sophomore CS majors, were 75, 60, 
55. How many sophomore C.S. majors are there? The Law of Addition applies here. There are exactly 75 + 60 + 55 = 190 CS majors since 
the rosters of the three courses listed above would be a partition of the CS majors 

Example 2.3.5. It was determined that all sophomore computer science majors took at least one of the following courses: Cryptography, 
Data Structures, or Java script. Assume the number in each course was as in Example 2.3.4. Further investigation indicated ten of them took 
all three courses, twenty-five took Calculus and Data Structures, twelve took Calculus and Compiler Construction, and fifteen took Data 
Structures and Compiler Construction. How many sophomore C.S. majors are there? 

Example 2.3.4 is a simple application of the law of addition; in Example 2.3.5, however, some students are taking two or more courses, so a 
simple application of the law of addition would lead to double or triple counting. We rephrase Example 2.3.5 in the language of sets to 
describe the situation more explicitly if we let: 

A = the set of all sophomore computer science majors 

Aj = the set of all sophomore CS majors who took Cryptography 

A 2 = the set of all sophomore CS majors who took Data Structures 

A3 = the set of all sophomore CS majors who took javascript. 

Since all sophomore CS majors must take at least one of the courses, the number we want is: 

|A| = I A! UA 2 U A 3 I 

= |A 1 | + |A 2 | + |A 2 | -duplicates 

A Venn diagram is helpful to visualize the problem. In this case the universal set U can stand for any reasonable set, for example, the set of 
all students in the university. 
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Figure 2.3.1 

We see that the whole universal set is naturally partitioned into subsets that are labeled by the numbers 1 through 8, and the set A is partitioned 
into subsets labeled 1 through 7. Note also that students in the subsets labeled 2,3, and 4 are double counted, and those in the subset labeled 1 
are triple counted, in the sense that they have already been subtracted from the total twice. So 

I A | = | A l U A 2 U A 3 | 

= | Ai | + | A 2 | + | A 3 | -duplicates 

= j -Ai | + j A 2 [ + | -A 3 J -(duplicates - triplicates) 

= | A { | + | A 2 | + | A 3 | - | Ai n A 2 | -|AinA 3 |-|A 2 nA 3 | + |AinA 2 nA3| 
= 75 + 60 + 55 - 25 - 12 - 15 + 10 = 148 

Note that an alternate approach to this example could be to let A = U; in this case the subset labeled 8, namely (A] IJ A 2 U A 3 ) c , is the set of 
sophomore C. S. students who took none of the three courses, which is 0. The concepts discussed in this latest basic counting technique give 
rise to the following two formulas: 

Laws of Addition (Inclusion-Exclusion Laws) 

1. If Ai and A 2 are finite sets, then | A[ U A 2 | = | A[ | + | A 2 \ - | Ai f\ A% \ 

2. IfAi,A 2 , and A 3 are finite sets, then 

|Aj UA 2 U A 3 | = | A, | + | A 2 | + | A 3 | 

- 1 a l n a 2 i _ | Ai n a 3 i - 1 a 2 n a 3 i 
+ 1 Ai n a z n a 3 i 

In this section we saw that being able to partition a set into disjoint subsets gives rise to a handy counting technique. Given a set, there are many 
ways to partition depending on what one would wish to accomplish. One natural partitioning of sets is apparent when one draws a Venn 
diagram. This particular partitioning of a set will be discussed further in Chapters 4 and 13. 

EXERCISES FOR SECTION 2.3 
A Exercises 

1 . Find all partitions of the set A = {a, b, c). 

2. Which of the following collections of subsets of the plane, R x R, are partitions? 

(a) {{(x, y) | x + y = c\ | c e R) 

(b) The set of all circles in IR xR. 

(c) The set of all circles in IR x IR centered at (0, 0), together with ((0, 0)}. 
(d){{(x, y)} | (x, j)eRxR) 

3. A student, on an exam paper, defined the term partition the following way: "Let A be a set. A partition of A is any set of nonempty subsets 
A[ , A 2 , . . . , A„ of A such that each element of A is in one of the subsets A,." Is this definition correct? Why? 

4. Let Ai and A 2 be subsets of a set A. Draw a Venn diagram of this situation and shade in the subsets: Ai |"| A 2 , A\ f] A 2 , A) f] A 2 , and 
A] n A 2 . Use the resulting diagram and the definition of partition to convince yourself that subset of these four subsets that are nonempty form 
a partition of A. 

5. Show that \\2n \ neZ.), {2n + 1 | n e Z)( is a partition of Z. Describe this partition using only words. 

6. (a) A group of 30 students were surveyed and it was found that 18 of them took Calculus and 12 took C++. If all students took at least one 
course, how many took both Calculus and C++? Illustrate using a Venn diagram. 

(b) What is the answer to the question in part (a) if five students did not take either of the two courses? Illustrate using a Venn diagram. 

7. A survey of 90 people indicated that all of them participated in at least one of the following activities. In addition, 25 jogged and did 
aerobics, 20 jogged and worked out using Nautilus, 10 worked out in aerobics and Nautilus. How many of them were involved in all three 
activities if 30 were enrolled in each one of the activities? 
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8. A survey of 300 people indicated: 

60 owned an iPhone 75 owned an Blackberry, and 30 owned an Android. Further more, 40 owned both an iPhone and Blackberry, 12 owned 
both an iPhone and Android, and 8 owned a Blackberry and an Android. Finally, 3 owned all three phones 

(a) How many people surveyed owned none of the three phones? 

(b) How many people owned an Blackberry but not an iPhone? 

(c) How many owned a Blackberry but not an Android B Exercises 

9. (a) Use Inclusion-exclusion Law 1 to derive Law 2. Note, a knowledge of basic set laws is needed for this exercise, (b) State and derive the 
Inclusion-exclusion law for four sets. 

10. To complete your spring schedule, you must add Calculus and Physics. At 9:30, there are three Calculus sections and two Physics sections; 
while at 1 1:30, there are two Calculus sections and three Physics sections. How many ways can you complete your schedule if your only open 
periods are 9:30 and 11:30? 

C Exercise 

11. The definition of Q = {ajb \ a, ieZ, b t 0) given in Chapter 1 is at best awkward, since, if we use the definition to list elements in Q, 

1 -1 2 

we will have duplications, that is, -, — , - etc. Try to write a more precise definition of the rational numbers so that there is no duplication of 
elements . 
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I 2.4 Combinations and the Binomial Theorem 



COMBINATIONS 

In Section 2.1 we investigated the most basic concept in combinatorics, namely, the rule of products. Even though in this section we investigate 
other counting formulas, it is of paramount importance to keep this fundamental process in mind. In Section 2.2 we saw that a subclass of rule- 
of-products problems appears so frequently that we gave them a special designation, namely, permutations, and we derived a formula as a 
computational aid to assist us. In this section we will investigate another major subclass of the rule-of-product formula, namely, that given the 
name combinations. 

In many rule-of-products applications the permutation or order is important, as in the situation of the order of putting on one's socks and shoes; 
in some cases it is not important, as in placing coins in a vending machine or in the listing of the elements of a set. Order is important in 
permutations . Order is not important in combinations . 

Example 2.4.1. How many different ways are there to permute three letters from the set A = {a, b, c, d)l From the permutation 

4' 

formula of Section 2.2 there are P (4; 3) = — ■ — = 24 different orderings of three letters from A. 



Example 2.4.2. How many ways can we simply list, or choose, three letters from the set A = {a, b, c, d)1 Note here that we are not 
concerned with the order of the three letters. By trial and error, certainly abc, abd, acd, and bed are the only listings possible. A slightly more 
elegant approach would be to rephrase the question in terms of sets. What we are looking for is all three-element subsets of the set A. Order is 

not important in sets. The notation for choosing 3 elements from 4 is most commonly I I or occasionally C (4; 3), either of which is read "4 

choose 3" or the number of combinations for four objects taken three at a time. 

Definition: Binomial Coefficient. The binomial coefficient ( J ) r C(n; k) represents the number of combinations of n objects taken k 
at a time, and is read "n choose k." 

We would now like to investigate the relationship between permutation and combination problems in order to derive a formula for . Let us 

reconsider the above examples. There are 3 ! = 6 different orderings for each of the three-element subsets of A. Table 2.4.1 lists each subset of 
A and all permutations of each subset on the same line. 



3 - element subsets of A 


Permutations of each subset 


abc 


abc, acb, bca, bac, cab, cba 


abd 


abd, adb, bda, bad, dab, dba 


acd 


acd, adc, cda, cad, dac, dca 


bed 


bed, bde, cdb, cbd, dbc, deb 



Table 2.4.1 



Hence, 



- P(4; 3) = 4. We generalize this result in the following theorem: 



Theorem 2.4.1. If A is any finite set ofn elements, the number of k-element subsets of A is: 
where < k < n. 



(".)■ 



kl(n-k)! 

Proof: There are k ! ways of ordering each of the k objects in any set of k elements . Therefore 



- P(n; k) : 



(«-«! 



k< (n-k)\ 



Alternate Proof: To "construct" a permutation of k objects from a set of n elements, we can first choose one of the subsets of objects and 
second, choose one of the k! permutations of those objects. By the rule of products, 



Pin; k) 
and solving for 



k' kl 
we get 

- P(n; k) : 



k< (n-k)< 



Example 2.4.3. Assume an evenly balanced coin is tossed five times. In how many ways can three heads be obtained? This is a combination 
problem, because the order in which the heads appear does not matter. The number of ways to get three heads is 



31 (5-3)! 



= 10 



Example 2.4.4. Determine the total number of ways a fair coin can land if tossed five times. The four tosses can produce any one of the 
following mutually exclusive, disjoint events: 5 heads, 4 heads, 3 heads, 2 heads, 1 head, or heads. Hence by the law of addition we have: 
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Of course, we could also have applied the extended rule of products, and since there are two possible outcomes for each of the five tosses, we 

have 2x2x2x2x2x2= 2 5 = 32 ways. You might think that doing this counting two ways is a waste of time but solving a problem two 

« I n \ 
different ways often is instructive and leads to valuable insights. In this case, it suggests a general formula for the sum Z , -In the case of n 

k=o\ k ' 

= 5 , we get 2 5 , so it is reasonable to expect that the general sum is 2" , and it is . 

Example 2.4.5. A committee usually starts as an unstructured set of people selected from a larger membership. Therefore, a committee 

can be thought of as a combination. If a club of 25 members has a five-member social committee, there are = 53 130 different possible 

social committees. If any structure or restriction is placed on the way the social committee is to be selected, the number of possible commit- 
tees will probably change. For example, if the club has a rule that the treasurer must be on the social committee, then the number of possibili- 

/24\ 
ties is reduced to = 10626. If we further require that a chairperson other than the treasurer be selected for the social committee, we 

/ 24 \ rr . . c , / 24 \ . 

have x4 = 42504 different possible social committees. The choice of the four non-treasurers accounts for the and the choice of 

a chairperson accounts for the 4. 

Example 2.4.6. There is = 1 way of choosing a combination of zero elements from a set of n, and there is =1 wa Y of 

choosing a combination of n elements from a set of n. 

THE BINOMIAL THEOREM 

The binomial theorem gives us a formula for expanding (x + y)" where n is a nonnegative integer. The coefficients of this expansion are 
precisely the binomial coefficients that we have used to count combinations. From high school algebra we can certainly compute (x + y)" for 
n = 0, 1, 2, 3, 4, 5 as given in the following table: 

(x + y)° = 1 1 

(x + y) 1 = x + y 11 

(x + y) 2 = x 2 + 2 y x + y 2 12 1 

(x + y) 3 = x 3 + 3 y x 2 + 3 y 2 x + y 3 13 3 1 

(x + y) 4 = x 4 + 4 y x 3 + 6 y 2 x 2 + 4 y 3 x + y 4 14 6 4 1 

(x + y) 5 =x 5 +5yx 4 + 10y 2 x 3 + 10y 3 x 2 + 5 y 4 x + y 5 1 5 10 10 5 1 

TABLE 2.4.2 

In the expansion of (x + y) 5 we note that the coefficient of the third term is =10, and that of the sixth term is = 1 . We can rewrite 

the expansion as I jc 5 + \y x 4 + \y 2 x 3 + \y 3 x 2 + \y 4 x + I , \y 5 

In summary, in the expansion of (x + y) n we note: 

1 . The first term is x" and the last term is y" . 

2. With each successive term, exponents of x decrease by 1 as those of y increase by 1 . For any term the sum of the exponents is n. 

3. The coefficient of x n ~ k y k , the ( k + 1 ) st term, is 

4. The triangular array of numbers in Table 2.4.2 is called Pascal's triangle after the seventeenth-century French mathematician Blaise Pascal. 
Note that each number in the triangle other than the l's at the ends of each row is the sum of the two numbers to the right and left of it in the 
row above. 

Theorem 2.4.2: The Binomial Theorem. Ifn > 0, and x andy are numbers, then 

(x+y)"= if ")x"- k y k 
This theorem will be proven using a procedure called mathematical induction, which will be introduced in Chapter 3. 
Example 2.4.8. Find the third term in the expansion of (x - y) 4 . Since (x - y) 4 = (x + (-y)) 4 , the third term is 
v 4 - 2 (-j) 2 = 6x 2 y 3 . 



CH 
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Example 2.4.9. Expand (3 x — 2) . If we replace x and y in the Binomial Theorem with 3 x and -2, respectively you get 
I /3 



(3x-2) 3 = 2 (^)(3x) 3 -*(-2)* 



J ) (3 x) 3 (-2)° + ( J ) (3 x) 2 (-2) 1 + ( j ) -*) 1 (~2) 2 + (J)(3x)° (-2) 3 
= 27x 3 -54x 2 + 36x-8 

Mathematics Note 

Mathematica has a built-in function for binomial coefficients. Binomial. Unlike the examples we've concentrated on that can be done without 
technology, you can compute extremely large values. For example, a bridge hand is a 13 element subset of a standard 52 card deck — the order 

• , • , , , • „ • , , ,,-,.,,, ■ 1 52 

in which the cards come to the player doesn't matter. From the point of view of a single player, the number of possible bridge hands is 

which is easily computed with Mathematica: 
Binomial [52, 13] 

635013559600 

In bridge, the location of a hand in relation to the dealer has some bearing on the game. An even truer indication of the number of possible 
hands takes into account each player's possible hand. It is customary to refer to bridge positions as West, North, East and South. We can apply 

'52^ 



the rule of product to get the total number of bridge hands with the following logic. West can get any of the hands identified above. Then 

I 39 \ I 26 \ 

North get 13 of the remaining 39 cards and so has ,,, possible hands. East then get 13 of the 26 remaining cards, which has I ., I possibili- 
ties. South gets the remaining cards. Therefore the number of bridge hands is 

Binomial [52, 13] Binomial [39, 13] Binomial [26, 13] 

53 644 737 765 488 792 839 237 440 000 

Ifflf^ Sage Note 

Sage will do the same calculations for bridge hands just as easily. From a command line interface the calculations look like this: 

sage: binomial(52,13) 

635013559600 

sage: binomial(52,13)*binomial(39,13)*binomial(26,13) 

53644737765488792839237440000 

The syntax is different, but the results are the same. 

EXERCISES FOR SECTION 2.4 
A Exercises 

1. The judiciary committee at a college is made up of three faculty members and four students. If ten faculty members and 25 students have 
been nominated for the committee, how many judiciary committees are possible? 

2. Suppose that a single character is stored in a computer using eight bits. 

(a) How many bit patterns have exactly three 1 's? 

(b) How many bit patterns have at least two 1 's? 

3 . How many subsets of { 1 , .... 10} contain at least seven elements? 

4. The congressional committees on mathematics and computer science are made up of five congressmen each, and a congressional rule is that 
the two committees must be disjoint. If there are 385 members of congress, how many ways could the committees be selected? 

5. Expand (2x - 3y) 4 

6. Find the fourth term of the expansion of (x — 2 yf . 

7 '. (a) A poker game is played with 52 cards. How many "hands" of five cards are possible? 
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(b) If there are four people playing, how many five-card "hands" are possible on the first deal? 

8. A flush in a five-card poker hand is five cards of the same suit. How many spade flushes are possible in a 52-card deck? How many flushes 
are possible in any suit? 

9. How many five-card poker hands using 52 cards contain exactly two aces? 

10. In poker, a full house is three-of-a-kind and a pair in one hand; for example, three fives and two queens. How many full houses are 
possible from a 52- card deck? 

11. A class of twelve computer science students are to be divided into three groups of 3, 4, and 5 students to work on a project. How many 
ways can this be done if every student is to be in exactly one group? 

B Exercises 

12. Explain in words why the following equalities are true and then verify the equalities using the formula for binomial coefficients. 

w (J)-(»-t)' 0a * s -- 

13. There are ten points P\,P2, ■ ■ ■ , P n on a plane, no three on the same line. 

(a) How many lines are determined by the points? 

(b) How many triangles are determined by the points? 

14. How many ways can n persons be grouped into pairs when n is even? Assume the order of the pairs matters, but not the order within the 
pairs. For example, if n = 4, the different groupings would be 

{{1,2), (3,4)) {{3,4), {1,2)) 
{{1,3), {2,4)) {{2,4), {1,3)) 
{{1,4), {2, 3)) {{2,3), {1,4)) 

15. Use the binomial theorem to prove that if A is a finite set, | P(A) | = 2l A l . Hint: see Example 2.4.7. 

16. (a) A state's lottery involves choosing six different numbers out of a possible 36. How many ways can a person choose six numbers? 
(b) What is the probability of a person winning with one bet? 

17. Use the binomial theorem to calculate 9998 3 Note that 9998 3 = (10000 - 2) 3 . 

18. Suppose two gamblers are playing poker and are dealt five cards each. Use technology to determine the number of possible ways the hands 
could be dealt. Assume, as in bridge, the order of the hands matters. 
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SUPPLEMENTARY EXERCISES FOR CHAPTER 2 

Section 2.1 

1. A university would like to determine how many different three-digit telephone extensions can be made using the digits 
0,1,2,3,4,5,6,7,8,9 

(a) if the three digits must be different 

(b) if the three digits can include duplications. 

2. A typical telephone number in most larger cities involves a seven-digit number using the digits 0, 1, 2, ..., 9. The first three digits are 
referred to as the office code, and are frequently the same for smaller cities. 

(a) How many different telephone numbers can one create using only the last four digits? What if all four digits must be different? 

(b) How many different telephone numbers can be created using all seven digits? What if the first digit cannot be the number 0? 

3. (a) Let A = {a, b, c, d, ejandB = {1, 2, 3). Draw a tree diagram similar to Figure 2.1.1 to illustrate A x B. 
(b) Let A and B be sets where \A\ =m and |B| = n. Use a tree diagram to prove \AxB\= \A\ x |B|. 

4. In a business meeting involving four executives, each person shakes every other person's hand. How many handshakes will occur? 

5. How many ways can three couples be seated at a round table? (This problem may be harder than it seems.) 

(a) The first way to consider seating them is as each couple as a unit. 

(b) The second way is that the people in a couple may switch seats. How does this affect the seating plan? 

6. For the main course at a certain restaurant a person has the choice of beef (prepared three ways), chicken (fried or broiled), or any one of two 
fish dishes. Baked or french fried potatoes and a choice of three vege tables complement the main course. Red or white wine or water are the 
only beverages served. However, the owner is fussy, and he will serve red wine or water only with meat dishes, and white wine or water only 
with the fish dishes . How many meals can be ordered? 

Section 2.2 

7 . (a) How many ways are there to arrange six people in a circle? Two arrangements are to be considered the same if everyone has the same 
right-hand neighbor and the same left-hand neighbor regardless of the exact seat each person occupies. 

(b) How many ways are there to arrange six people in a circle if person p\ cannot sit next to person P2? 

8 . There are five roads from city A to city B and six roads from city B to city C . There are no direct routes from A to C . 

(a) How many different ways are there from city A to C? 

(b) How many different ways are there from A to C and back to A? 

(c) What is the answer to part b if each road is to be used exactly once? 

Section 2.3 

9. (a) A contest is entered by ten people. There will be four different prizes awarded among the ten people. No person will receive more than 
one prize. In how many ways can the prizes be distributed? 

(b) Suppose that instead of awarding four different prizes to the ten people it is decided that there will be four $10 prizes, three $7 prizes, 
and three $5 prizes. Suppose that each person will win a prize. In how many ways can the prizes be distributed? 

10. A real estate developer has finished constructing twelve houses and must paint them. He has purchased quantities of white and blue 
paint so that five of the houses will be blue and the remainder will be white. How many ways can he decide to paint the houses? 

1 1 . Ten persons apply for three identical jobs and all are qualified for the positions . Two of these persons will work in New York and the other 
one will work in San Diego. How many ways can the positions be filled? 

Section 2.4 

12. Let A = (1, 2, 3, ... , 10). 

(a) Determine the cardinality of V (A) . 

(b) How many subsets of A contain exactly four elements? 

(c) How many subsets of A contain at least four elements? 

13. Among eleven senators, how many ways are there to select (a) a committee of five members; (b) a committee of five members so that a 
particular senator, Senator A, is always included; (c) a committee of five so that either Senator A or Senator B will be included? 

t n \ I n 

14. Use the combination formula to prove that = 
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15. (a) How many ways can three people be seated in a car with four seats? Assume that someone must drive, 
(b) What is your answer if only two of the three people have a driver's license? 

16. (a) Let T = {1,2, 3, 4, 5). How many subsets of T have less than four elements? 

(b) How many proper subsets of {1,2, 3, 4, 5} contain the numbers 1 and 5? How many of them also do not contain the number 2? 
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chapter 3 




LOGIC 



GOALS 

In this chapter, we will introduce some of the basic concepts of mathematical logic. In order to fully understand some of the later concepts in 
this book, you must be able to recognize valid logical arguments. Although these arguments will usually be applied to mathematics, they 
employ the same techniques that are used by a lawyer in a courtroom or a physician examining a patient. An added reason for the importance of 
this chapter is that the circuits that make up digital computers are designed using the same algebra of propositions that we will be discussing. 
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I 3.1 Propositions and Logical Operators 



PROPOSITIONS 

Definition: Proposition. A proposition is a sentence to which one and only one of the terms true or false can be meaningfully applied. 

Example 3.1.1. "Four is even," "4 e {1, 3, 5)," and "43 > 21" are propositions. 

In traditional logic, a declarative statement with a definite truth value is considered a proposition. Although our ultimate aim is to discuss 
mathematical logic, we won't separate ourselves completely from the traditional setting. This is natural because the basic assumptions, or 
postulates, of mathematical logic are modeled after the logic we use in everyday speech. Since compound sentences are frequently used in 
everyday speech, we expect that logical propositions contain connectives like the word and. The statement "Europa supports life or Mars 
supports life" is a proposition and, hence, must have a definite truth value. Whatever that truth value is, it should be the same as the truth value 
of "Mars supports life or Europa supports life." 

LOGICAL OPERATORS 

There are several ways in which we commonly combine simple statements into compound ones. The words/phrases and, or, not, if... then, and 
if and only if can be added to one or more propositions to create a new proposition. To avoid any confusion, we will precisely define each one's 
meaning and introduce its standard symbol. With the exception of negation (not), all of the operators act on pairs of propositions. Since each 
proposition has two possible truth values, there are four ways that truth can be assigned to two propositions. In defining the effect that a logical 
operator has on two propositions, the result must be specified for all four cases. The most convenient way of doing this is with a truth table, 
which we will illustrate by defining the word and. 

Conjunction 

Definition: Conjunction (And). If p and q are propositions, their conjunction, p and q (denoted p f\ q), is defined by the truth table in 
Table 3.1.1. 

P 1 pAq 



1 

1 
1 1 1 

Table 3.1.1. Truth Table for And 

Notes: 

(a) To read this truth table, you must realize that any one line represents a case: one possible set of values for p and q. 

(b) The numbers and 1 are used to denote false and true, respectively. This is consistent with the way that many programming languages 
treat logical, or Boolean, variables since a single bit, or 1, can represent a truth value. Although Mathematical logical expressions have a 
value of True or False, there is a built in function called Boole which converts the value to 1 or 0, if desired. 

{Boole [False] , Boole [True]} 

(0,1) 

(c) For each case, the symbol under p represents the truth value of p. The same is true for q. The symbol under p l\q represents the truth value 
of p A q for that case. For example, the second row of the truth table represents the case in which p is false, q is true, and the resulting truth 
value for p f\ q is false. As in everyday speech, p /\ q is true only when both propositions are true. 

(d) Just as the letters x, y, and z are frequently used in algebra to represent numeric variables, p, q, and r seem to be the most commonly used 
symbols for logical variables. When we say that p is a logical variable, we mean that any proposition can take the place of p. 

(e) One final comment: The order in which we list the cases in a truth table is standardized in this book. If the truth table involves two simple 
propositions, the numbers under the simple propositions can be inter preted as the two-digit binary integers in increasing order, 00, 01, 10, and 
11, for 0,1, 2, and 3. 

Disjunction 

Definition: Disjunction (Or). If p and q are propositions, their disjunction is p or q, denoted p V q, and is defined by the truth table in 
Table 3.1 .2. 



p 


<7 


p\lq 














1 


1 


1 





1 


1 


1 


1 



Table 3.1.2. Truth Table for Or 
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Note; The only case in which disjunction is false is when both propositions are false. This interpretation of the word or is called the nonexclu- 
sive or. The exclusive or will be discussed when we consider logical design in Chapter 13. 

Negation 

Definition: Negation (Not). Ifp is a proposition, its negation, not p, is denoted -<p and is defined by the truth table in Table 3.1 3. 

p -,p 

1 

1 

Table 3.1.3 Truth Table for Not 

Note: Negation is the only standard operator that acts on a single proposition; hence only two cases are needed. 

The Conditional Operator (If. . . then). 

Consider the following propositions from everyday speech: 

(a) I'm going to quit if I don't get a raise. 

(b) If I pass the final, then I'll graduate. 

(c) I'll be going to the movies provided that my car starts. 

All three propositions are conditional, they can all be restated to fit into the form if Condition, then Conclusion. For example, statement (a) can 
be rewritten as "If I don't get a raise, then I'm going to quit." 

A conditional statement is meant to be interpreted as a guarantee; if the condition is true, then the conclusion is expected to be true. It says no 
more and no less. 

Definition: Conditional Operator. The conditional statement ifp then q, denoted p -> q, is defined by the truth table in Table 3.1.4. 

p q p^q 
o o 1 

1 1 

1 o o 
l l l 

Table 3.1.4 Truth Table for //... then 

Example 3.1.2. Assume your instructor told you "If you receive a grade of 95 or better in the final examination, then you will receive an 
A in this course." Your instructor has made a promise to you (placed a condition with you). If you fulfill his condition you expect the 
conclusion (getting an A) to be forthcoming. Your graded final has been returned to you. Has your instructor told the truth (kept the promise) 
or is your instructor guilty of a falsehood? 

Case I: Your final exam score was less than 95 (the condition is false) and you did not receive an A (the conclusion is false). The instructor 
told the truth. 

Case II: Your final exam score was less than 95, yet you received an A for the course. The instructor told the truth. (Perhaps your overall 
course average was excellent.) 

Case III: Your final exam score was greater than 95, but you did not receive an A. The instructor lied. 

Case IV: Your final exam score was greater than 95, and you received an A. The instructor told the truth. 

To sum up, the only case in which a conditional proposition is false is when the condition is true and the conclusion is false. 

The order of the condition and conclusion in a conditional proposition is important. If the condition and conclusion are exchanged, a different 
proposition is produced. 

Definition: Converse. The converse of the proposition p-> q is the proposition q ->p. 

The converse of "If you receive a grade of 95 or better in the final exam, then you will receive an A in this course," is "If you receive an A in 
this course, then you received a grade of 95 or better in the final exam." It should be clear that these two statements say different things. 

Definition: Biconditional Operator (...if and only if...). If p and q are propositions, the biconditional statement "p if and only if q," 
denoted p <-> q, is defined by the truth table in Table 3.1.5. 

P q p^q 

1 

1 

1 
1 1 1 

Table 3.1.5 Truth table for "... if and only if..." 

Note that p <-> q is true when p and q have the same truth values. It is common to abbreviate "if and only if to "iff." 
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Although "if . . . then" and "if and only if are frequently used in everyday speech, there are several alternate forms that you should be aware of. 
They are summarized in the following list: 

Conditional 

If p then q. 

p implies q. 

q follows from q. 

p, only if q. 

q,ifp. 

p is sufficient for q. 

q is necessary for p. 

Biconditional 

p if and only if q. 

p is necessary and sufficient for q. 

p is equivalent to q. 

If p, then q, and if q, then p. 

If p, then q and conversely. 

EXERCISES FOR SECTION 3.1 
A Exercises 

1. Let d = "I like discrete structures" c = "I will pass this course" s = "I will do my assignments" Express each of the following propositions in 
symbolic form: 

(a) I like discrete structures and I will pass this course. 

(b) I will do my assignments or I will not pass this course. 

(c) It is not true that I like discrete structures and I will do my assignments. 

(d) I will not do my assignment and I will not pass this course. 

2. For each of the following propositions, identify simple propositions, express the compound proposition in symbolic form, and determine 
whether it is true or false: 

(a) The world is flat or zero is an even integer. 

(b) If 432,802 is a multiple of 4, then 432,802 is even. 

(c) 5 is a prime number and 6 is not divisible by 4. 

(d) 3 e Z and 3 e Q. 
(e)2/3 e Zand 2/3 e Q. 

(f) The sum of two even integers is even and the sum of two odd integers is odd. 

3. Let p = "2 < 5," q = "8 is an even integer," and r = "11 is a prime number." Express the following as a statement in English and determine 
whether the statement is true or false: 

(a) - p V q 

(b) p^q 

(c) (p Aq)^ r 
{A)p^q\l (-. r) 

(e) p -» (-, q) V (-■ r) 
(f)^q^^p 

4. Rewrite each of the following statements using the other conditional forms: 
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(a) If an integer is a multiple of 4, then it is even. 

(b) The fact that a polygon is a square is a sufficient condition that it is a rectangle. 
(c)If x = 5, then* 2 = 25. 

(d)Ifx 2 - 5x + 6 = O.thenx = 2orx = 3. 
(e) x 2 = y 2 is a necessary condition for x = y. 
5 . Write the converse of the propositions in exercise 4. Compare the truth of each proposition and its converse. 
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I 3.2 Truth Tables and Propositions Generated by a Set 

Consider the compound proposition c = (p A <7) V C -1 1 A '"). where p, q, and r are propositions. This is an example of a proposition gener- 
ated by p, q, and r. We will define this terminology later in the section. Since each of the three simple propositions has two possible truth 
values, it follows that there are eight different combinations of truth values that determine a value for c. These values can be obtained from a 
truth table for c. To construct the truth table, we build c from p, q, and r and from the logical operators. The result is Table 3.2.1. Strictly 
speaking, the first three columns and the last column make up the truth table for c. The other columns are work space needed to build up to c. 

P 1 r pAq ~-q ->qf\r {p /\q)\j ' (-. q /\r) 









1 








1 





1 


1 


1 


1 














1 1 














1 





1 








1 1 





1 


1 


1 


1 1 


1 








1 


1 1 1 


1 








1 



Table 3.2.1 Truth Table for c = (p A q) V (-> 1 A r) 

Note that the first three columns of the truth table are an enumeration of the eight three-digit binary integers. This standardizes the order in 
which the cases are listed. In general, if c is generated by n simple propositions, then the truth table for c will have 2" rows with the first n 
columns being an enumeration of the n digit binary integers. In our example, we can see at a glance that for exactly four of the eight cases, c 
will be true. For example, if p and r are true and q is false (the sixth case), then c is true. 

Let S be any set of propositions . We will give two definitions of a proposition generated by S. The first is a bit imprecise, but should be clear. 
The second definition is called a recursive definition. If you find it confusing, use the first definition and return to the second later. 

Definition: Proposition Generated by S. 

(1) A proposition generated by S is any valid combination of propositions in S with conjunction, disjunction, and negation. 

(2) (a) Ifp E S, then p is a proposition generated by S. 

(b) If x and y are propositions generated by S, then so are (x), -i x, x V y , and x f\y. 

Note: We have not included the conditional and biconditional in the definition because they can both be obtained from conjunction, disjunction, 
and negation, as we will see later. 

If 5 is a finite set, then we may use slightly different terminology. For example, if S = {p, q, r), we might say that a proposition is generated 
by p, q, and r instead of {p, q, r). 

Hierarchy of Logical Operations 

// is customary to use the following hierarchy for interpreting propositions, with parentheses overriding this order: 
First: Negation 
Second: Conjunction 
Third: Disjunction 

Within any level of the hierarchy, work from left to right. Using these rules, p A q V '" is taken to mean (p /\ q) /\ r. These precedence rules 
are universal, and are exactly those used by computer languages to interpret logical expressions. 

Example 3.2.1. A few shortened expressions and their fully parenthesized versions: 

(a) p A q A r is (p A q) A r. 

(b) -pV-nsbp)V (-T). 
(c) -. -. p is -. (-i p). 

A proposition generated by a set S need not include each element of S in its expression. For example, -> q A r is a proposition generated by p, 
q, and r. 

EXERCISES FOR SECTION 3.2 
A Exercises 

1 . Construct the truth tables of: 

(a) p V p 

(b) p A (- P) 

(c)pV(-p) 
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(d) p A p 

2. Construct the truth tables of: 

(a) -(M<7) 

(b) P A (-■ q) 

(c) (p Aq)Ar 

(d) (pA ? )\/(?A/')V(fA p) 

(e) (-■ p) V - q) 

(f) P V?VrVs 

3. Rewrite the following with as few extraneous parentheses as possible: 

(a) (-- dp) A ('•))) V (s) 

(b) ((p) V (9)) A «r) V («)) 

4. In what order are the operations in the following propositions performed? 

(a) p V - q V r A - p 

(b) P A - 9 A r A - P 

5. Determine the number of rows in the truth table of a proposition containing four variables p, q, r, and.?. 

6. If there are 45 lines on a sheet of paper, and you want to reserve one line for each line in a truth table, how large could | S \ be if you can 
write truth tables of propositions generated by S on a sheet of paper? 
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I 3.3 Equivalence and Implication 



Tautologies & Contradictions 

Consider two propositions generated by p and q: -i (p f\ q) and -i p V ~< q- At first glance, they are different propositions. In form, they are 
different, but they have the same meaning. One way to see this is to substitute actual propositions forp and q; such as: 

p: I've been to Toronto; and q: I've been to Chicago. 

Then -i (p A q) translates to "I haven't been to both Toronto and Chicago," while -> p V -i q. is "I haven't been to Toronto or I haven't been to 
Chicago." Determine the truth values of these propositions. Naturally, they will be true for some people and false for others. What is important 
is that no matter what truth values they have, -i (p A q) and ->p V "■ q- will have the same truth value. The easiest way to see this is by 
examining the truth tables of these propositions () . 

p i ->(Mtf) -"pV- 1 ? 

1 1 

Oil 1 

10 1 1 

110 

Table 3.3.1 Truth tables of -. (p A q) and -. p V -> <7- 

In all four cases, -i (p A <f) and "■ P V "" q- have the same truth value. Then when the biconditional operator is applied to them, the result is a 
value of true in all cases . 

Definition: Tautology. An expression involving logical variables that is true in all cases is called a tautology. 
Example 3.3.1. All of the following are tautologies because their truth tables consist of a column of l's. 

(a) (-.(p A ? ))H(- ip V -•«). 

(b) p V - p 

(c) (p A q) -» P 

(d) q^(pVq) 

(e) ( P V?)h( ? V P) 

Definition: Contradiction. An expression involving logical variables that is false for all cases is called a contradiction. 

Example 3.3.2. p A "> P and (p V q) A ("■ P) A ("■ q) are contradictions. 
Equivalence 

Definition: Equivalence. Let S be a set of propositions and let r and s be propositions generated by S. r and s are equivalent if and only 
if r <-> s is a tautology. The equivalence ofr and s is denoted r <=> s. 

Example 3.3.3. The following are all equivalences: 

(a) (pA?)V(-pA?)« q. 

(b) p -» q <=$ ->q -» ~>p 

(c) p V q^q\l P- 

All tautologies are equivalent to one another. We will use the number 1 to symbolize a tautology. 

Example 3.3.4. p V "> P <=^> I- 
All contradictions are equivalent to one another. We will use the number to symbolize a contradiction. 

Example 3.3.5. p A -< p <=^> 0. 

Equivalence is to logic what equality is to algebra. Just as there are many ways of writing an algebraic expression, the same logical meaning 
can be expressed in many different ways. 

IMPLICATION 

Example 3.3.6. Consider the two propositions: 

x: The money is behind Door A; and 

y: The money is behind Door A or Door B . 
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Imagine that you were told that there is a large sum of money behind one of two doors marked A and B , and that one of the two propositions 
x and y is true and the other is false. Which door would you choose? All that you need to realize is that if x is true, then y will also be true. 
Since we know that this can't be the case, y must be the true proposition and the money is behind Door B . 

This is an example of a situation in which the truth of one proposition leads to the truth of another. Certainly, y can be true when x is false; but x 
can't be true when y is false. In this case, we say that x implies y. 

Look at the truth table of p -» q in Table 3.1.4 (copied below). If p implies q, then the third case can be ruled out, since it is the case that makes 
a conditional proposition false. 

p q p^q 
o o 1 

1 1 

1 

1 1 1 

Definition: Implication. Let S be a set of propositions and let r and s be propositions generated by S. We say that r implies s ifr -> s is a 
tautology. We write r => s to indicate this implication. 

Example 3.3.7. A commonly used implication is thatp implies p \J q, which is verified by the truth table in Table 3.3.2. 

P q pVq P^pVq 









1 


1 


1 


1 


1 


1 


1 


1 1 


1 


1 



Table 3.3.2 Truth Table for p -> (p V q) 

If we letp represent "The money is behind Door A" and q represent "The money is behind Door B," p => (p V <?) is a formalized version of 
the reasoning used in Example 3.3.6. A common name for this implication is disjunctive addition. In the next section we will consider some 
of the most commonly used implications and equivalences . 

When we defined what we mean by a proposition generated by a set in Section 3.2, we didn't include the conditional and biconditional 
operators. This was because of the two equivalences p^>q<^>^p\Jq and p «-» q «=> (p A q) V C -1 P A -l <?)• Therefore, any proposi- 
tion that includes the conditional or biconditional operators can be written in an equivalent way using only conjunction, disjunction, and 
negation. We could even dispense with disjunction since p \/ q is equivalent to a proposition that uses only conjunction and negation. 

EXERCISES FOR SECTION 3.3 
A Exercises 

1 . Given the following propositions generated by p, q, and r, which are equivalent to one another? 

(a) (p A r) V q 

(b) p\l{r\/q) 

(c) r A p 

(d) -. r V P 

(e) (/>V<?)A(rV<7) 

(f) r -» p 

(g) r\f ->p 

(h) p -» r 

2. (a) Construct the truth table for x = (p A ->q) V (r A p)- 

(b) Give an example other than x itself of a proposition generated by p, q, and r that is equivalent to x. 

(c) Give an example of a proposition other than x that implies x. 

(d) Give an example of a proposition other than x that is implied by x. 

3. Is an implication equivalent to its converse? Verify your answer using a truth table. 

4. Suppose that x is a proposition generated by p, q, and r that is equivalent to p V ~<q- Write out the truth table for x. 

B Exercises 

5 . How large is the largest set of propositions generated by p and q with the property that no two elements are equivalent? 
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6. Find a proposition that is equivalent to p V <7 an d uses on ly conjunction and negation. 

7. Explain why a contradiction implies any proposition and any proposition implies a tautology. 
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I 3.4 The Laws of Logic 



In this section, we will list the most basic equivalences and implications of logic. Most of the equivalences listed in Table 3.4.1 should be 
obvious to the reader. Remember, stands for contradiction, 1 for tautology. Many logical laws are similar to algebraic laws. For example, 
there is a logical law corresponding to the associative law of addition, a + (b + c) = (a + b) + c. In fact, associativity of both conjunction 
and disjunction are among the laws of logic. Notice that with one exception, the laws are paired in such a way that exchanging the symbols A, 
V, 1 and for V, A, 0, and 1, respectively, in any law gives you a second law. For example, p V <=> p results in p f\ 1 <=> p. This called a 
duality principle. For now, think of it as a way of remembering two laws for the price of one. We will leave it to the reader to verify a few of 
these laws with truth tables. However, the reader should be careful in applying duality to the conditional operator and implication since the dual 
involves taking the converse. For example, the dual of p f\ g => p is p V <? <= P> which is usually written p=)p V 1 
Example 3.4.1. The identity law: 

P 1 pAi (pAi)^p 
10 1 

111 1 

therefore, (p A <=> P- 

Some of the logical laws in Table 3.4.2 might be less obvious to you. For any that you are not comfortable with, substitute actual propositions 
for the logical variables. For example, if p is "John owns a pet store" and q is "John likes pets," the detachment law should make sense. 

TABLE 3.4.1 Basic Logical Laws 





Commutative Laws 






p\J qdq\J p p /\q ^q Ap 






Associative Laws 




(p V q)\/r O p V («V '•) (P A q) A ' 


•«pA(?Ar) 




Distributive Laws 




p A (q V r) ^ 


(J> A q) V (P A r) pV(?Ar)« 


(P V g) A (p V r) 




Identity Laws 






p V 0<=>p p A 1 « 


P 




Negation Laws 






p A->p«*o pV-p^i 






Idempotent Laws 






pVpwp pAp«p 






Null Laws 






pAO«o pVi<«i 






Absorption Laws 






p AlpV <?)<=> p p V (p A q) <=> 


P 




DeMorgan's Laws 




-(pV«) 


' <=> (-> p) A (-> q) - (p A q) <=> ( 


- P) V (-> <?) 



Involution Law 
-. (-. p) <=> p 
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TABLE 3.4.2 Common Implications and Equivalences 



Detachment 


(p ^ q) A p ^ q 


Indirect Reasoning 


(p^<?)A-"7=>-'P 


Disjunctive Addition 


p^>(p\l q) 


Conjunctive Simplification 


A q) => p and (p A q) =* q 


Disjunctive Simplification 


(p V q) A -> p => q and (p \/ 9) A "" q =» p 


Chain Rule 


(p -» ?) A (tf -» r) =* (p -» '") 


Conditional Equivalence 


p->?«-./)V? 


Biconditional Equivalences 


(p <-» q) <^> (p -> <?) A (g -» p) <=> (p A 9) V (-> P A - ?) 



Contrapositive 
(P -> ?) ** (-> 9 -> "" P) 

EXERCISES FOR SECTION 3.4 
A Exercises 

1 . Write the following in symbolic notation and determine whether it is a tautology: "If I study then I will learn. I will not learn. Therefore, I do 
not study." 

2. Show that the common fallacy (p -» q) f\ -1 p => -1 q is not a law of logic. 

3. Describe in general how duality can be applied to implications if we introduce the symbol <=, read "is implied by." 

4. Write the dual of the following statements: 

(a) (p A q) => P 

(b) (p V q) A - 9 => P 
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I 3.5 Mathematical Systems 



In this section, we present an overview of what a mathematical system is and how logic plays an important role in one. The axiomatic method 
that we will use here will not be duplicated with as much formality anywhere else in the book, but we hope an emphasis on how mathematical 
facts are developed and organized will help to unify the concepts we will present. The system of propositions and logical operators we have 
developed will serve as a model for our discussion. Roughly, a mathematical system can be defined as follows: 

Definition: Mathematical System. A mathematical system consists of: 

(1) A set or universe, U. 

(2) Definitions — sentences that explain the meaning of concepts that relate to the universe. Any term used in describing the universe itself is 
said to be undefined. All definitions are given in terms of these undefined concepts of objects. 

(3) Axioms — assertions about the properties of the universe and rules for creating and justifying more assertions. These rules always 
include the system of logic that we have developed to this point. 

(4) Theorems — the additional assertions mentioned above. 

Example 3.5.1. In Euclidean geometry the universe consists of points and lines (two undefined terms). Among the definitions is a 
definition of parallel lines and among the axioms is the axiom that two distinct parallel lines never meet. 

Example 3.5.2. Propositional calculus is a formal name for the logical system that we've been discussing. The universe consists of 
propositions. The axioms are the truth tables for the logical operators and the key definitions are those of equivalence and implication. We 
use propositions to describe any other mathematical system; therefore, this is the minimum amount of structure that a mathematical system 
can have. 

Definition: Theorem. A true proposition derived from axioms of mathematical system is called a theorem. 

Theorems are normally expressed in terms of a finite number of propositions , p\. pi. . . . , p„ , called the premises, and a proposition, C, called 
the conclusion. These theorems take the form 

Pi A piA ■■■ Ap„^c 

or more informally, 

Pi, P2, ..., and p„ imply C 

For a theorem of this type, we say that the premises imply the conclusion. When a theorem is stated, it is assumed that the axioms of the system 
are true. In addition, any previously proven theorem can be considered an extension of the axioms and can be used in demonstrating that the 
new theorem is true. When the proof is complete, the new theorem can be used to prove subsequent theorems. A mathematical system can be 
visualized as an inverted pyramid with the axioms at the base and the theorems expanding out in various directions (Figure 3.5.1). 




FIGURE 3.5.1 The body of knowledge In a mathematical system 

PROOF 

Definition: Proof. A proof of a theorem is a finite sequence of logically valid steps that demonstrate that the premises of a theorem imply 
the conclusion . 

Exactly what constitutes a proof is not always clear. For example, a research mathematician might require only a few steps to prove a theorem 
to a colleague, but might take an hour to give an effective proof to a class of students. Therefore, what constitutes a proof often depends on the 
audience. But the audience is not the only factor. One of the most famous theorems in graph theory, The Four Color Theorem, was finally 
proven in 1976, after over a century of effort by many mathematicians. Part of the proof consisted of having a computer check many different 
graphs for a certain property. Without the aid of the computer, this checking would have taken years. In the eyes of some mathematicians, this 
proof was considered questionable. Shorter proofs have been developed since 1976 and there is no controversy associated with The Four Color 
Theorem at this time. (The theorem is stated in Chapter 9.) 

PROOFS IN PROPOSITIONAL CALCULUS 

Theoretically, you can prove anything in propositional calculus with truth tables. In fact, the laws of logic stated in Section 5.4 are all theorems. 
Propositional calculus is one of the few mathematical systems for which any valid sentence can be determined true or false by mechanical 
means. A program to write truth tables is not too difficult to write; however, what can be done theoretically is not always practical. For example, 

a, a -» b, b -» c, . . . , y -> z =* Z 
is a theorem in propositional calculus. However, suppose that you wrote such a program and you had it write the truth table for 

(a A (a-> *)A(*-> c)A- A(y-»z))->z 
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The truth table will have 2 2S cases. At one million cases per second, it would take approximately one minute to verify the theorem. Now if you 
decided to check a similar theorem, 

Pi, Pi ~>P2, ■■■ ,P99^Pl00 =*PlOO, 

you would really have time trouble. There would be 2 100 as 1.26765 x 10 30 cases to check in the truth table. At one million cases per second it 
would take approximately 1.467 19 x 10 19 days to check all cases. For most of the remainder of this section, we will discuss an alternate method 
for proving theorems in propositional calculus. It is the same method that we will use in a less formal way for proofs in other systems. Formal 
axiomatic methods would be too unwieldy to actually use in later sections. However, none of the theorems in later chapters would be stated if 
they couldn't be proven by the axiomatic method. 

We will introduce two types of proof here, direct and indirect. 

DIRECT PROOFS 

A direct proof is a proof in which the truth of the premises of a theorem are shown to directly imply the truth of the theorem's conclusion. 

Example 3.5.3. Theorem: p -» r, q -» s, p\J q=$ sV r. A direct proof of this theorem is: 



Step 


Proposition 


Justification 


(1) 


PV 1 


Premise 


(2) 


->p -» q 


(1), conditional rule 


(3) 


q -» s 


Premise 


(4) 


-i p -» s 


(2), (3), chain rule 


(5) 


-i s -» p 


(4) , contrapositive 


(6) 


p -> r 


Premise 


(7) 


-i s -* r 


(5), (6), chain rule 


(8) 


s Mr 


(7), conditional rule 



Note that ■ marks the end of a proof. 

Rules for Formal Proofs. Example 3.5.3 illustrates the usual method of formal proof in a formal mathematical system. The rules governing 
these proofs are: 

(1) A proof must end in a finite number of steps. 

(2) Each step must be either a premise or a proposition that is implied from previous steps using any valid equivalence or implication. 

(3) For a direct proof , the last step must be the conclusion of the theorem. For an indirect proof (see below), the last step must be a 
contradiction. 

(4) Justification Column. The column labeled "justification" is analogous to the comments that appear in most good computer programs. They 
simply make the proof more readable. 

Example 3.5.4. Here are two direct proofs of -i p \J q, s\J p, -i q => s: 

(1) -i p V q Premise 

(2) -i q Premise 

(3) -i p Disjunctive simplification, (1), (2) 

(4) s\J p Premise 

(5) s Disjunctive simplification, (3), (4). ■ 
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You are invited to justify the steps in this second proof: 

(1) -np \J q 

(2) -iq -> -ip 

(3) Wp 

(4) p V * 

(5) -p^ ,v 

(6) -i 4 -» .s 

(7) -■« 

(8) s m 

CONDITIONAL CONCLUSIONS 

The conclusion of a theorem is often a conditional proposition. The condition of the conclusion can be included as a premise in the proof of the 
theorem. The object of the proof is then to prove the consequence of the conclusion. This rule is justified by the logical law 
p -» (h -» c) <s=> (p A h) -» c. 
Example 3.5.5. The following proof of p ->(q->s),-rVp,q=>r->s includes r as a fourth premise. The truth of s concludes the proof . 



(1) 


-i r V p 


Premise 


(2) 


r 


Added premise 


(3) 


P 


(1), (2), disjunction simplification 


(4) 


p -» (q -*s) 


Premise 


(5) 


q -» .s 


(3), (4), detachment 


(6) 


<7 


Premise 


(7) 


.V 


(5), (6), detachment. ■ 



INDIRECT PROOFS/ PROOF BY CONTRADICTION 

Consider a theorem P => C, where P represents p ly pi, ■ ■ ■ , and p„ , the premises. The method of indirect proof is based on the equivalence 
P^Cd^(PA -■ Q. 

In words, this logical law states that if P => C, then P A "> C is always false; that is, P A "■ C is a contradiction. This means that a valid 
method of proof is to negate the conclusion of a theorem and add this negation to the premises. If a contradiction can be implied from this set of 
propositions, the proof is complete. For the proofs in this section, a contradiction will often take the form f A -if. For proofs involving 
numbers, a contradiction might be 1 = or < 0. Indirect proofs involving sets might conclude with x e or (x e A and x e A c ). Indirect 
proofs are often more convenient than direct proofs in certain situations. Indirect proofs are often called proof s by contradiction. 

Example 3.5.6. Here is an example of an indirect proof of the theorem in Example 3.5.3: 



(1) 


- (s V r) 


Negated conclusion 




(2) 


-i s A -i r 


DeM organ's Law, (1) 




(3) 


-i s 


Conjunctive simplification, 


(2) 


(4) 


q -> s 


Premise 




(5) 


-■« 


Indirect reasoning, (3), (4) 




(6) 


-i r 


Conjunctive simplification, 


(2) 


(7) 


p -» r 


Premise 




(8) 


~T 


Indirect reasoning, (6), (7) 




(9) 


(- P) A (-■ q) 


Conjunctive, (5), (8) 




(10) 


- (P V q) 


DeMorgan's Law, (9) 




(11) 


P V q 


Premise 




(12) 





(10),(11)B 
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PROOF STYLE 

The rules allow you to list the premises of a theorem immediately; however, a proof is much easier to follow if the premises are only listed 
when they are needed. 



Example 3.5.7. Here is an indirect proof of a -» b, -> (b \/ c) 



(1) 


a 


Negation of the conclusk 


(2) 


a -> b 


Premise 


(3) 


b 


(1), (2), detachment 


(4) 


b\J c 


(3), disjunctive addition 


(5) 


-> (* V c) 


Premise 


(6) 





(4),(5)» 



As we mentioned at the outset of this section, we are only presenting an overview of what a mathematical system is. For greater detail on 
axiomatic theories, see Stoll (1961). An excellent description of how prepositional calculus plays a part in artificial intelligence is contained in 
Hofstadter (1980). If you enjoy the challenge of constructing proofs in propositional calculus, you should enjoy the game WFF'N PROOF 
(1962), by L. E.Allen. 

EXERCISES FOR SECTION 3.5 
A Exercises 

1 . Prove with truth tables: 

(a) p V q, -■ q => P 

(b) p -» q, ->q => ->p 

2. Prove with truth tables: 

(a) q, -i q =*• p 

(b)p^?=»-pV? 
B Exercises 

3. Give direct and indirect proofs of: 

(a) a -> b, c -> b, d -» (a V c), d => b. 

(b) <J>->q) A (r -» s), (^ -» A Gs -» k), -> (f A «), /? -» r =» -./?. 

(c) p -» (^ -» i), -i i V p, q =$ s -» r. 

(d) p -» 9, 9 -» r, -i (p A '"), p V r => '■■ 

(e) -i g, p -» g, pV( => ? 

4. Give direct and indirect proofs of: 

(a) p -» g, -i r -» -i g, -i r =» -ip. 

(b) p -» -i g, -i r -» g, p => r. 

(c) a \/ b, c /\ d, a -> -i c =$ b. 

5 . Are the following arguments valid? If they are valid, construct formal proofs; if they aren't valid, explain why not. 

(a) If wages increase, then there will be inflation. The cost of living will not increase if there is no inflation. Wages will increase. Therefore, the 
cost of living will increase. 

(b) If the races are fixed or the casinos are crooked, then the tourist trade will decline. If the tourist trade decreases, then the police will be 
happy. The police force is never happy. Therefore, the races are not fixed. 

6. Determine the validity of the following argument: For students to do well in a discrete mathematics course, it is necessary that they study 
hard. Students who do well in courses do not skip classes. Students who study hard do well in courses. Therefore students who do well in a 
discrete mathematics course do not skip class. 

7. Describe how pi, pi -* pi, ... , P99 -» P100 => Pioo could be proven in 199 steps. 
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I 3.6 Propositions over a Universe 



Example 3.6.1. Consider the sentence "He was a member of the Boston Red Sox." There is no way that we can assign a truth value to 
this sentence unless "he" is specified. For that reason, we would not consider it a proposition. However, "he" can be considered a variable that 
holds a place for any name. We might want to restrict the value of "he" to all names in the major-league baseball record books. If that is the 
case, we say that the sentence is a proposition over the set of major-league baseball players, past and present. 

Definition: Proposition over a Universe. Let U be a nonempty set. A proposition over U is a sentence that contains a variable that can take on 
any value in U and that has a definite truth value as a result of any such substitution. 

Example 3.6.2. 

(a) A few propositions over the integers are 4 x 2 -3x = Q,0<n < 5, and "k is a multiple of 3." 

(b) A few propositions over the rational numbers are 4 x 2 — 3 x = 0, y 2 = 2 , and (s - 1) (s + 1) = s 2 - 1. 

(c) A few propositions over the subsets of IP are (A = 0) V (A = P),3 e A, and A f| {1, 2, 3} * 0. 

All of the laws of logic that we listed in Section 3.4 are valid for propositions over a universe. For example, if p and q are propositions over the 
integers, we can be certain that p A 1 => p, because (p A q) -> P is a tautology and is true no matter what values the variables in p and q are 
given. If we specify p and q to be p (n) : n < 4 and q (n) : n < 8, we can also say that p implies p /\ q. This is not a usual implication, but for 
the propositions under discussion, it is true. One way of describing this situation in general is with truth sets. 

TRUTH SETS 

Definition: Truth Set. If p is a proposition over U, the truth set ofp is T p = [a e U \ p(a) is true). 

Example 3.6.3. The truth set of the proposition {1, 2) f]A = taken as a proposition over the power set of {1, 2, 3, 4) is 
(0, (3), (4), (3, 4)). 

Example 3.6.4. In the universe Z (the integers), the truth set of Ax 2 - 3x = is (0). If the universe is expanded to the rational numbers, 
the truth set becomes {0, 3/4). The term solution set is often used for the truth set of an equation such as the one in this example. 

Definition: Tautology and Contradiction. A proposition over U is a tautology if its truth set is U. It is a contradiction if its truth set is 
empty. 

Example 3.6.5. (s - \)(s + 1) = s 2 - 1 is a tautology over the rational numbers, x 2 - 2 = is a contradiction over the rationals . 
The truth sets of compound propositions can be expressed in terms of the truth sets of simple propositions. For example, if a e T p i\ q , then a 
makes p l\ q true. Therefore, a makes both p and q true, which means that a e T p f]T q . This explains why the truth set of the conjunction of 
two propositions equals the intersection of the truth sets of the two propositions. The following list summarizes the connection between 
compound and simple truth sets: 

T p /\q = T P f]T q 
Tp\lq = T p U T q 

^-(^nrjuivnr,') 

T = T c I I T 

± p->q p ^-J q 

Definition: Equivalence. Two propositions are equivalent if p <-> q is a tautology. In terms of truth sets, this means that p and q are 
equivalent if T p =T q . 

Example 3.6.6. 

(a) n + 4 = 9 and n = 5 are equivalent propositions over the integers . 

(b) A P| (4) =A and 4 e A are equivalent propositions over the power set of the natural numbers. 
Definition: Implication. If p and q are propositions over U, p implies q ifp -* q is a tautology. 

Since the truth set of p -» q is T p IJ T„, the Venn diagram for T p ^„ in Figure 3.6.1 shows that p => q when T p <zT q . 
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FIGURE 3.6.1 Venn diagram for T p _, q 
Example 3.6.7. 

(a) Over the natural numbers: n < 4 => n < 8 since {0, 1, 2, 3, 4) c {0, 1, 2, 3, 4, 5, 6, 7, 8). 

(b) Over the power set of the integers: | A c | =1 implies Afl|0, 1) * 0. 

(c) A c even integers => A f] odd integers =0. 

EXERCISES FOR SECTION 3.6 
A Exercises 

1. If £7 = P({1, 2, 3, 4}), what are the truth sets of the following propositions? 

(a) A n {2, 4} = 0. 

(b) 3 e A and 1 £ A. 

(c) A U (1) = A. 

(d) A is a proper subset of {2, 3, 4). 

(e) \A\ = |A e |. 

2. Over the universe of positive integers, define 

p{n) : n is prime and « < 32. 
q (n) : « is a power of 3 . 
r (n) : n is a divisor of 27. 

(a) What are the truth sets of these propositions? 

(b) Which of the three propositions implies one of the others? 

3. If U = (0, 1 , 2), how many propositions over U could you list without listing two that are equivalent? 

4. Given the propositions over the natural numbers: 

p : n < A 
q : In > 17 
r : n is a divisor of 1 8 
what are the truth sets of: 

(a) q 

(b) P Aq 

(c) r 

(d) q^r 

5. Suppose that s is a proposition over {1, ..., 8} . If T s = {1, 3, 5, 7), give two examples of propositions that are equivalent to s. 

6. (a) Determine the truth sets of the following propositions over the 
positive integers: 

p{n) : n is a perfect square and n < 100. 
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q(n) : n = \ V(A) | for some set A 
(b) Determine 7\,/\ 9 forp and q above. 

7. Let the universe be Z, the set of integers. Which of the following propositions are equivalent over - 
a: < n 1 < 9. 
b: < n 3 < 27. 
c: < n < 3. 
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I 3.7 Mathematical Induction 



In this section, we will examine mathematical induction, a technique for proving propositions over the positive integers. Mathematical (or 
finite) induction reduces the proof that all of the positive integers belong to a truth set to a finite number of steps . 

Mathematical Induction is sometimes called finite induction. 

Example 3.7.1. Consider the following proposition over the positive integers, which we will label p (n): The sum of the positive integers 

n (tt+ 1) 

from 1 to n is . This is a well-known formula that is quite simple to verify for a given value of n. For example, p(5) is: The sum of the 

positive integers from 1 to 5 is . Indeed, 1+2 + 3+4 + 5=15= — — . Unfortunately, this doesn't serve as a proof that p (n) is a 

tautology. All that we've established is that 5 is in the truth set of p. Since the positive integers are infinite, we certainly can't use this 
approach to prove the formula. 

An Analogy: Mathematical induction is often useful in overcoming a problem such as this one. A proof by mathematical induction is similar to 
knocking over a row of closely spaced dominos that are standing on end. To 

knock over the five dominos in Figure 3.7.1, all you need to do is push Domino 1 to the right. To be assured that they all will be knocked over, 
some work must be done ahead of time. The dominos must be positioned so that if any domino is pushed to the right, it will push the next 
domino in the line. 



FIGURE 3.7.1 Illustration of example 3.7.1 

Now imagine the propositions p(l), p (2), p(3), ... to be an infinite line of dominos. Let's see if these propositions are in the same formation 
as the dominos were. First, we will focus on one specific point of the line: p(99) and p(100). We are not going to prove that either of these 
propositions is true, just that the truth of p(99) implies the truth of p (100). In terms of our analogy, if p(99) is knocked over, it will knock over 
p(lOO). 

In proving p(99) => p(100), we will use p(99) as our premise. We must prove: The sum of the positive integers from 1 to 100 is . We 

start by observing that the sum of the positive integers from 1 to 100 is (1 + 2 + ••• + 99) + 100. That is, the sum of the positive integers 
from 1 to 100 equals the sum of the first ninety-nine plus the final number, 100. We can now apply our premise, p(99), to the sum 
1 + 2 + "• + 99. After rearranging our numbers, we obtain the desired expression for 1 + 2 + ••• + 100: 

1 + 2 + ••• + 99 + 100 =(1 + 2 + ••• + 99) + 100 

99(99+1) 



2 



+ 100 



99 100 2 > 

+ — 

2 

100 x 101 



2 
100(100+1) 



What we've just done is analogous to checking two dominos in a line and finding that they are properly positioned. Since we are dealing with an 
infinite line, we must check all pairs at once. This is accomplished by proving that p(n) => p(n + 1) for all n > 1: 

1 + 2 + ••• + n + (n + 1) = (1 + 2 + ••• + n) + (n + 1) 

= —^— + (n + 1) by p(n) 

n(n+l) 2(n+l) 



_ (n + l)(n+2) 

2 
_ (n+l)((n+l)+l) 
2 

They are all lined up! Now look at p(l): The sum of the positive integers from 1 to 1 is — . Clearly, p(\) is true. This sets off a chain reaction. 
Since p (1) => p (2), p (2) is true. Since p (2) => p (3), p(3) is true; and so on. ■ 

The Principle of Mathematical Induction. Let p(n) be a proposition 
over the positive integers, then p(n) is a tautology if 

(1) p{\) is true, and 

(2) for all n > 1, p(n) => p(n + 1). 



Note: The truth of p (1) is called the basis for the induction proof. The premise that p(n) is true in Statement (b) is called the induction hypothe- 
sis. The proof that p (n) implies p (n + 1) is called the induction step of the proof. Despite our analogy, the basis is usually done first in an 
induction proof. The order doesn't really matter. 

Example 3.7.2. Consider the implication over the positive integers 
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pin): q -> q u q x ->#,, ... , q„-\ ->q„, 9o => q„ 

A proof that p in) is a tautology follows . 

Basis: p(l) is qo -* qi, qo => <Ji . This is the logical law of detachment which we know is true. If you haven't done so yet, write out the truth 
table of Hqo -» q\ ) j\ qo) -» q\ to verify this step. 

Induction: Assume that p in) is true for some n > 1 . We want to prove that pin + 1) must be true. That is: 

9o -» 9i> 9i -»?2» ■■■ . 9«-i -»?b. 9« ->9n+l, tfo =* 9n + l 
Here is a direct proof of p in + 1): 
Steps Proposition(s) Justification 

(1) -(n + 1) go -» 91, ?i -»?2, ■■■ , 9«-i -*q n , 9o Premises 

(n + 2) q„ il) - in + 1), pin) 

in + 3) 9n -> 9n+i Premise 

(n + 4) 5„+i (n + 2), (n + 3), detachment ■ 

Example 3.7.3. For all n > 1 , n 3 + 1 n is a multiple of 3. An inductive proof follows: 
Basis: 1 3 + 2(1) = 3 is a multiple of 3. 
The basis is almost always this easy! 

Induction: Assume that n > 1 and n 3 + 2 n is a multiple of 3. Consider (n + l) 3 + 2 (n + 1). Is it a multiple of 3? 

(n + l) 3 + 2 (n + 1) = in 3 + 3 n 2 + 3 n + 1) + (2 n + 2) 

= n 3 + 2n + 3n 2 + 3n + 3 Rearrange the terms 

= in 3 + 2n) + 3(n 2 + n + 1) 

Yes, (n + l) 3 + 2 (n + 1) is the sum of two multiples of 3; therefore, it is also a multiple of 3. ■ 

Variations of Induction 

Now we will discuss some of the variations of the principle of mathematical induction. The first simply allows for universes that are similar to 
IP, like (-2, -1, 0, 1, ...)or{5, 6, 7, 8, ...). 

Principle of Mathematical Induction (Generalized). If p in) is a proposition over {k , k + 1, k + 2, ... \, where k is any integer, then 
p in) is a tautology if 

(1) pikg) is true, and 

(2) forall«>£ , pin) =» pin + 1). 

Example 3.7.4. In Chapter 2, we stated that the number of different permutations of k elements taken from an n element set, P in; k), can 
be computed with the formula — — . We can prove this statement by induction on n. For n > 0, let q in) be the proposition 

Pin: k) = - for all k from to n. 

(n-k)\ 

Basis: q (0) states that 

P(0; 0) = the number of ways that elements can be 

selected from the empty set and arranged in order 
= 0!/0! = 1. 

This is true — a general law in combinatorics is that there is exactly one way of doing nothing. 

Induction: Assume that q in) is true for some natural number n. It is left for us to prove that this assumption implies that qin + 1) is true. 
Suppose that we have a set of cardinality n+1 and want to select and arrange k of its elements. There are two cases to consider, the first of 
which is easy. If k = 0, then there is one way of selecting zero elements from the set; hence 

Pin + 1; 0) = 1 - ( " +1)! 



(n+1+0)! 

and the formula works in this case. 

The more challenging case is to verify the formula when k is positive and less than or equal to n + 1. Here we count the value of Pin + 1; k) 
by counting the number of ways that the first element in the arrangement can be filled and then counting the number of ways that the 
remaining k — 1 elements can be filled in using the induction hypothesis . 

There are n+1 possible choices for the first element. Since that leaves n elements to fill in the remaining k - 1 positions, there are 
Pin; k - 1) ways of completing the arrangement. By the rule of products, 
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P(n +l;k) = (n+ l)P(n;k 
= (n + 1) 



(n+l)n\ 



(n-k+iy. 



A second variation allows for the expansion of the induction hypothesis. The course-of-values principle includes the previous generalization. It 
is also sometimes called strong induction. 

The Course-of- Values Principle of Mathematical Induction. If p(ri) is a proposition over {k , k + 1, k + 2, ... }, where k is any 
integer, then p (n) is a tautology if 

(1) p(^o) is true , an d 

(2) for all n > k , p(k ), p(k + 1), p(n) => p(n + 1). 

Example 3.7.5. A prime number is defined as a positive integer that has exactly two positive divisors, 1 and itself. There are an infinite 
number of primes. The list of primes starts with 2, 3, 5, 7, 11,... . The proposition over (2, 3, 4, . . .) that we will prove here is p (n): n can 
be written as the product of one or more primes. In most texts, the assertion that p (n) is a tautology would appear as: 

Theorem. Every positive integer greater than or equal to 2 has a prime decomposition. 

If you were to encounter this theorem outside the context of a discussion of mathematical induction, it might not be obvious that the proof can 
be done by induction. Recognizing when an induction proof is appropriate is mostly a matter of experience. Now on to the proof! 

Basis: Since 2 is a prime, it is already decomposed into primes (one of them). 

Induction: Suppose that for some k > 2 all of the integers 2,3, . . . , k have a prime decomposition. Notice the course-of-value hypothesis. 
Consider k + 1 . Either k + 1 is prime or it isn't. If A: + 1 is prime, it is already decomposed into primes. If not, then k + 1 has a divisor, d, 
other than 1 and k + 1. Hence, k + 1 = cd where both c and d are between 2 and k. By the induction hypothesis, c and d have prime 
decompositions, c t c^ ■■■ c„, and d\ d^ ■■■ d„, .respectively. Therefore, k + 1 has the prime decomposition C] C2 •■■ c,„ d\ d^ ■■■ d,„. ■ 

HISTORICAL NOTE 

Mathematical induction originated in the late nineteenth century. Two mathematicians who were prominent in its development were Richard 
Dedekind and Giuseppe Peano. Dedekind developed a set of axioms that describe the positive integers. Peano refined these axioms and gave a 
logical interpretation to them. The axioms are usually called the Peano Postulates. 

Peano's Postulates. The system of positive integers consists of a nonempty set, P; a least element of P, denoted 1; and a "successor function," 
s , with the properties 

(1) If k e P , then there is an element of P called the successor of k, denoted s {k). 

(2) No two elements of P have the same successor. 

(3) No element of P has 1 as its successor. 

(4) If S c P, 1 e S, and k e S => s (k) e 5, then S = P. 




Richard Dedekind Giuseppe Peano 

Notes: 

(a) You might recognize s (k) as simply being k + 1 . 

(b) Axiom 4, mentioned above, is the one that makes mathematical induction possible. In an induction proof, we simply apply that axiom to the 
truth set of a proposition. 

Exercises for Section 3.7 
A Exercises 

1 . Prove that the sum of the first n odd integers equals n 2 . 

2. Prove that if n > 1, then 1(1!) + 2(2!) + ••■ + n(n\) = (n + 1)! - 1. 
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3. Prove that for n > 1: £r= - n(n + 1) (In + 1). 

*=i 6 

4. Prove that for n > 1: £ 2* = 2" +1 - 1. 

k=0 

5 . Use mathematical induction to show that for n > 1 , 

l l t l _ _«_ 

1x2 2x3 n(n+[) n+\ 

6. Prove that if n > 2, the generalized DeMorgan's Law is true: 

-(Pi Ap 2 A- Ap„)<=>(-pi)V (-P2) V ■■■ V(-p„) 
B Exercises 

7. The number of strings of n zeros and ones that contain an even number of ones is 2" _1 . Prove this fact by induction for n > 1 . 

8 . Let p (n) be 8" - 3" is a multiple of 5 . Prove that p (n) is a tautology over N . 

9. Suppose that there are n people in a room, n > 1, and that they all shake hands with one another. Prove that — — handshakes will have 
occurred. 

10. Prove that it is possible to make up any postage of eight cents or more using only three- and five-cent stamps. 

C Exercises 

11. Generalized associativity. It is well known that if a\, «2. and a-} are numbers, then no matter what order the sums in the expression 
Oi + 02 + a 3 are taken in, the result is always the same. Call this fact p(3) and assume it is true. Prove using course-of-values induction that if 
a\, d2, ..., and a n are numbers, then no matter what order the sums in the expression a\ + ai_ + ••• + a n are taken in, the result is always the 
same. 

12. Let S be the set of all numbers that can be produced by applying any of the rules below in any order a finite number of times. 

Rule 1: - e S 

2 

Rule 2: 1 e S 

Rule 3: If a and b have been produced by the rules, then ab e S. 

Rule 4: If a and b have been produced by the rules, then — e S. 

Prove by course-of-values induction that agX =) 0<d< 1. Hint: The number of times the rules are applied should be the integer that you do 
the induction on. 

13. A recursive definition is similar to an inductive proof. It consists of a basis, usually the simple part of the definition, and the recursion, 
which defines complex objects in terms of simpler ones. For example, if x is a real number and n is a positive integer, we can define x" as 
follows : 

Basis: x l = x . 

Recursion: if n > 2,x" = x" _1 x . 

For example, x 3 = x 2 x = (x [ x) x = (xx)x. Proofs involving objects that are defined recursively are often inductive. Prove that if n, m e P, 
x m+ " = x m x". Hint: Let p (m) be the proposition that x m+n = x m x" for all n > 1 . There is much more on recursion in Chapter 8. 

14. Let 5 be a finite set and let P„„ be defined recursively by Pi = S and P n = 5xP„_i for n > 2. 

(a) List the elements of P3 for the case S = {a, b}. 

(b) Determine the formula for \P n |, given that \S \ =k, and prove your formula by induction. 
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I 3.8 Quantifiers 



As we saw in Section 3.6, if p (n) is a proposition over a universe U, its truth set T p is equal to a subset of U. In many cases, such as when p (n) 
is an equation, we are most concerned with whether T„ is empty or not. In other cases, we might be interested in whether T p = U; that is, 
whether p (n) is a tautology. Since the conditions T p =£ and T p = U are so often an issue, we have a special system of notation for them. 

THE EXISTENTIAL QUANTIFIER 

If p(n) is a proposition over U with T p t 0, we commonly say "There exists an n in U such that p(n) (is true)." We abbreviate this with the 
symbols (3 n)u (p(n)). The symbol 3 is called the existential quantifier. If the context is clear, the mention of U is dropped: (3 n) (p(n)). 
Example 3.8.1. 

(a) (3 k) z (k 2 - k - 12 = 0) is another way of saying that there is an integer that solves the equation k 2 - k - 12 = 0. The fact that two such 
integers exist doesn't affect the truth of this proposition in any way. 

(b) (3k) z (3k = 102) simply states that 102 is a multiple of 3, which is true. On the other hand, (3 k) z (3k= 100) states that 100 is a 
multiple of 3, which is false. 

(c) (3 x) K (x 2 + 1 = 0) is false since the solution set of the equation x 2 + 1 = in the real numbers is empty. It is common to write 
x) R (x 2 + 1 = 0) in this case. 

There are a wide variety of ways that you can write a proposition with an existential quantifier. Table 3.8.1 contains a list of different variations 
that could be used for both the existential and universal quantifiers . 

THE UNIVERSAL QUANTIFIER 

If p (n) is a proposition over U with T p = U, we commonly say "For all n in U, p(n) (is true)." We abbreviate this with the symbols 
(V ri)u (p(n)). The symbol V is called the universal quantifier. If the context is clear, the mention of U is dropped: (V n) (p(n)). 
Example 3.8.2. 

(a) We can say that the square of every real number is non-negative symbolically with a universal quantifier: (V x) R (x 2 > 0). 

(b) (V n) z (n + = + n = n) says that the sum of zero and any integer n is n. This fact is called the identity property of zero for addition. 

Table 3.8.1 Notational Variations for Existential and Universal Quantifiers 

Universal Quantifier Existential Quantifier 

(Vn) v (j>m (3n) u (p(n)) 

(V»£U)««» (3neU)(j>m 

V/ie(/, p(n) 3 n e U such that p(n) 

p(n), V n e U p (n) is true for some n e U 

p (n) is true for all n e U 

THE NEGATION OF QUANTIFIED PROPOSITIONS 

When you negate a quantified proposition, the existential and universal quantifiers complement one another. 

Example 3.8.3. Over the universe of animals, define F(x) : x is a fish and W(x) : x lives in the water. We know that the proposition W(x) -» F(x) 
is not always true. In other words, (Vx)(W(x) -» F(x)) is false. Another way of stating this fact is that there exists an animal that lives in the 
water and is not a fish; that is, 

-n (V x) (W(x) -> F(x)) <=> (3 x) (-i (W(x) -> F(x))) . 
<=> (3x)(W(x) A ->F(jc)) 

Note that the negation of a universally quantified proposition is an existentially quantified proposition. In addition, when you negate an 
existentially quantified proposition, you obtain a universally quantified proposition. Symbolically, 

- ((V n) v {p{n)) )«(3n)„(n p{n))), and 
-,((3n)„(pW))« (Vn)y(-p(n))) 
Example 3.8.4. 

(a) The ancient Greeks first discovered that y2 is an irrational number; that is, y2 is not a rational number, -i ((3 r)q (r 2 = 2)) and 
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(V r)<Q (r 2 * 2) both state this fact symbolically. 

(b) -i ((V n) P (n 2 - n + 41 is prime)) is equivalent to (3 n) P (n 2 - n + 41 is composite). They are both either true or false. 

MULTIPLE QUANTIFIERS 

If a proposition has more than one variable, then you can quantify it more than once. For example, if p (x, y) : x 2 - y 2 = (x + y) (x — y) is a 
tautology over the set of all pairs of real numbers because it is true for each pair (x, y) in IR xR. Another way to look at this proposition is as a 
proposition with two variables. The assertion that p (x, y) is a tautology could be quantified as (V x) R ((V y) R (p (x, y))) or 

(va«vi) r wi, m 

In general, multiple universal quantifiers can be arranged in any order without logically changing the meaning of the resulting proposition. The 
same is true for multiple existential quantifiers. For example, p(x, y) : x + y = 4andx - y — 2 is a proposition over R x IR. 
(3 x) K ((3 y) R (x + y = 4 and x - y = 2)) and (3 y) K ((3 x) R (x + y = 4 and x - y = 2)) are equivalent. A proposition with multiple 
existential quantifiers such as this one says that there are simultaneous values for the quantified variables that make the proposition true. A 
similar example is q (x, y) :2x - y - 2 and 4 x - 2y = 5, which is always false; and the following are all equivalent 

- ((3 x) R ((3 y) R (q(x, y)))) » -■ (3 y) R ((3 *) R (<?(*, y)))) 

«(V)') R (-(0 4»,)'))) 
<=>((Vy) R ((V*) R (-.<?(*, y)))) 
<=>((V*) R ((Vy) R (-<?(*, y)))) 

When existential and universal quantifiers are mixed, the order cannot be exchanged without possibly changing the meaning of the proposition. 
For example, let R + be the positive real numbers, x : (V a) R + ((3 fo) R + (ab = 1)) and y : (3 b) R * ((V a) R + (ab = 1)) have different meanings; x 
is true, while y is false. 

TIPS ON READING MULTIPLY QUANTIFIED PROPOSITIONS 

It is understandable that you would find propositions such as x difficult to read. The trick to deciphering these expressions is to "peel" one 
quantifier off the proposition just as you would peel off the layers of an onion (but quantifiers shouldn't make you cry) . Since the outermost 
quantifier in x is universal, x says that z (a) : (3 fc) R + (ab = 1) is true for each value that a can take on. Now take the time to select a value for 
a, like 6. For the value that we selected, we get z(6) : (3 b) R + (6 b = 1), which is obviously true since 6 b = 1 has a solution in the positive real 
numbers. We will get that same truth value no matter which positive real number we choose for a; therefore, z (a) is a tautology over IR + and we 
are justified in saying that x is true. The key to understanding propositions like x on your own is to experiment with actual values for the 
outermost variables as we did above. 

Now consider y. To see that y is false, we peel off the outer quantifier. Since it is an existential quantifier, all that y says is that some positive 
real number makes w(b) : (V a) R + (a b = 1) true. Choose a few values of b to see if you can find one that makes w (b) true. For example, if we 
pick b = 2, we get (V a) R + (2 a = 1), which is false, since 2 a is almost always different from 1 . You should be able to convince yourself that 
no value of b will make w (b) true. Therefore, y is false. 
Another way of convincing yourself that y is false is to convince yourself that -i y is true: 

- ((3 b\+ ((V a) R+ (a b = 1))) » (V b\, -, ((V a) R+ (ab= 1)) 

»(Vi) r (0 fl ) r ( a fc * 1)) 

In words, for each value of b, a value for a that makes a b * 1 . One such value is a = - + 1 . Therefore, -i y is true. 

EXERCISES FOR SECTION 3.8 
A Exercises 

1 . Let C (x) be "x is cold-blooded," let F (x) be "x is a fish," and let 5 (x) be "x lives in the sea." 

(a) Translate into a formula: Every fish is cold-blooded. 

(b) Translate into English: (3 x) (S (x) f\ ->F (x)) 
and (V x) (F (x) -» SO)). 

2. Let M (x) be "x is a mammal," let A (x) be "x is an animal," and let W (x) be "x is warm-blooded." 

(a) Translate into a formula: Every mammal is warm-blooded. 

(b) Translate into English: (3 x) (A (x) /\(-<M (x))). 

3. Over the universe of books, define the propositions B(x): x has a blue cover, M(x): x is a mathematics book, C(x): x is published in the 
United States, and R (x, y) : The bibliography of x includes y. Translate into words: 

(a) (3jc)(-.B(jc)). 

(b)(Vi)(M(i) AU(x)-> B(x)). 

(c) (3x)(M(x) A -■«(*)). 
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(d)(3y)((Vx)(M(x)^R(x,y))). 
Express using quantifiers: 

(e) Every book with a blue cover is a mathematics book. 

(f) There are mathematics books that are published outside the United States. 

(g) Not all books have bibliographies. 

revised 

4. Let the universe of discourse, V, be the set of all people, and let M(x, y) be "x is the mother of v." 

(a) Which of the following is a true statement? Translate it into English, 
(i) (3x) u ((Vy) u (M(x,y))) 

(ii) Ciy) u ((3x) u (M(x,y))) 

(b) Translate the following statement into logical notation using quantifiers and the proposition M(x, y) over U: "Everyone has a grandmother," 

5. Translate into your own words and indicate whether it is true or false that (3 a) z (4 u 2 - 9 = 0). 

6. Use quantifiers to say that v3 is an irrational number. 

7. What do the following propositions say, where U is the power set of { 1 , 2, ... , 9)? Which of these propositions are true? 

(a)(VA)t;(|A| * |A C |). 

(b) {3A) V {3B) V {\A\ =5, |B| =5, andAnfi = 0) 

(c) {MA) V {VB) V {A-B = &-A<) 

8. Use quantifiers to state that for every positive integer, there is a larger positive integer. 

9. Use quantifiers to state that the sum of any two rational numbers is rational. 

10. Over the universe of real numbers, use quantifiers to say that the equation a + x = b has a solution for all values of a and b. Hint: You will 
need three quantifiers . 

1 1 . Let n be a positive integer. Describe using quantifiers: 

n 

(a) x e IJ A k 

(b) xe f]A k 

k=l 

12. Prove that (3 x) (V y) (p (x, y)) => (V y) (3 x) (p (x, v)), but the converse is not true. 
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I 3.9 A Review of Methods of Proof 



One of the major goals of this chapter is to acquaint the reader with the key concepts in the nature of proof in logic , which of course carries over 
into all areas of mathematics and its applications. In this section we will stop, reflect, and "smell the roses," so that these key ideas are not lost 
in the many concepts covered in logic . In Chapter 4 we will use set theory as a vehicle for further practice and insights into methods of proof. 

KEY CONCEPTS IN PROOF 

1 . All theorems in mathematics can be expressed in "If P then C" (P => C) format, or in "C\ if and only if C2" (P <s=> C) format. The latter is 
equivalent to "If Ci then C2 , and if C2 then C\ ." Alternate ways of expressing conditional propositions are found in Section 3.1. 

2. In "If P then C," P is the premise (or hypothesis) and C is the conclusion. It is important to realize that a theorem makes a statement that is 
dependent on the premise being true. 

3 . There are two basic methods for proving P => C: 

(a) Direct: Assume P is true and prove C is true; and 

(b) Indirect (proof by contradiction): Assume P is true and C is false and prove that this leads to a contradiction of some premise, theorem, 
or basic concept. 

4. The method of proof for "If and only if" (iff) theorems is found in the law (P <-> C) <=> {{P -» C) A (C -» P)). Hence to prove an "If and 
only if" statement one must prove an "if . . . then ..." statement and its converse. 

The initial response of most people when confronted with the task of being told they must be able to read and do proofs is: 

(a) Why? or, 

(b) I cannot do proofs. 

To answer the first question, problem solving, even on the most trivial level, involves being able to read statements. First we must understand 
the problem and know the hypothesis; second, we must realize when we are done and we must understand the conclusion. To apply theorems or 
algorithms we must be able to read theorems and their proofs intelligently. 

To be able to do the actual proofs of theorems we are forced to learn: 

(1) the actual meaning of the theorems, and 

(2) the basic definitions and concepts of the topic discussed. 

For example, when we discuss rational numbers and refer to a number x as being rational, this means we can substitute a fraction - in place of 

x, with the understanding that p and q are integers and q±0. Therefore, to prove a theorem about rational numbers it is absolutely necessary 
that you know what a rational number "looks like." 

It's easy to comment on the response, "I cannot do proofs." Have you tried? As elementary school students we were in awe of anyone who 
could handle algebraic expressions, especially complicated ones. We learned by trying and applying ourselves. Maybe we cannot solve all 
problems in algebra or calculus, but we are comfortable enough with these subjects to know that we can solve many and can express ourselves 
intelligently in these areas . The same remarks hold true for proofs . 

THE ART OF PROVING P => C 

First one must completely realize what is given, the hypothesis. The importance of this is usually overlooked by beginners. It makes sense, 
whenever you begin any task, to spend considerable time thinking about the tools at your disposal. Write down the premise in precise language. 
Similarly, you have to know when the task is finished. Write down the conclusion in precise language. Then you usually start with P and 
attempt to show that C follows logically. How do you begin? Basically you attack the proof the same way you solve a complicated equation in 
elementary algebra. You may not know exactly what each and every step is but you must try something. If we are lucky, C follows naturally; if 
it doesn't, try something else. Often what is helpful is to work backward from C. Finally, we have all learned, possibly the hard way, that 
mathematics is a participating sport, not a spectator sport. One learns proofs by doing them, not by watching others do them. We give several 
illustrations of how to set up the proofs of several examples. Our aim here is not to prove the statements given, but to concentrate on the logical 
procedure. 

Example 3.9.1. We will outline a proof that the sum of any two odd integers is even. Our first step will be to write the theorem in the 
familiar conditional form: If j and k are odd integers, then j + k is even. The premise and conclusion of this theorem should be clear now. 
Notice that if j and k are not both odd, then the conclusion may or may not be true. Our only objective is to show that the truth of the premise 
forces the conclusion to be true. Therefore, we can express the integers j and k in the form that all integers take; that is: 

n e Z is odd implies (3meZ)(n = 2m + 1). 
This observation allows us to examine the sum y + k and to verify that it must be even. 

Example 3.9.2. Let n e Z. We will outline a proof that n 1 is even if and only if n is even. 
Outline of a proof: Since this is an "If and only if theorem we must prove two facts (see key concept number 4 above): 
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I. (=>) If n 2 is even, then n is even. To do this directly, assume that n 2 is even and prove that n is even. To do this indirectly, assume n 2 is 
even and that n is odd, and reach a contradiction. It turns out that the latter of the two approaches is easiest here. 

II. (<=) If n is even, then n 2 is even. To do this directly, assume that n is even and prove that n 2 is even. 

Now that we have broken the theorem down into two parts and know what to prove, we proceed to prove the two implications. The final 
ingredient that we need is a convenient way of describing even integers. When we refer to an integer n (or m, or k,. . . ) as even, we can 
always replace it with a product of the form 2 q, where q is an integer (more precisely, (3q) z (« = 2 q)). In other words, for an integer to be 
even it must have a factor of two in its prime decomposition. 

Example 3.9.3. Our final example will be an outline of the proof that the square root of 2 is irrational (not an element of Q). This is an 
example of the theorem that does not appear to be in the standard P => C form. One way to rephrase the theorem is: If x is a rational number, 
then x 2 ± 2. A direct proof of this theorem would require that we verify that the square of every rational number is not equal to 2. There is no 
convenient way of doing this , so we must turn to the indirect method of proof. In such a proof, we assume that x is a rational number and that 

x 2 = 2 (i.e., V 2 is a rational number). This will lead to a contradiction. In order to reach this contradiction, we need to use the following facts: 

(a) A rational number is a quotient of two integers . 

(b) Every fraction can be reduced to lowest terms , so that the numerator and denominator have no common factor greater than 1 . 

(c) If n is an integer, n 2 is even if and only if n is even. 

EXERCISES FOR SECTION 3.9 
B Exercises 

1 . Prove that the sum of two odd positive integers is even. 

2. Write out a complete proof that if n is an integer, rr is even if and only if n is even. 

3. Write out a complete proof that v2 is irrational. 

4. Prove that V 2 is an irrational number. 

5 . Prove that if x and y are real numbers such that x + y < 1 , then either x < - or y < - . 

6. Use the following definition of absolute value to prove the given statements: If x is a real number, then the absolute value of x, | x |, is 
defined by: 

x if x > 
-x if x < 

(a) For any real number x, \x\ > 0. Moreover, | x \ = implies x = 0. 

(b) For any two real numbers x and y, \x\- \y\ = \xy\. 

(c) For any two real numbers x and y, \ x + y \ < \ x \ + \ y \ . 
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| SUPPLEMENTARY EXERCISES FOR CHAPTER 3 



Section 3.1 

1 . Construct the truth tables of 

(a) p V P 

(b) p A (-. p) 

(c) p V (-■ p) 

(d) P A P 

2. Express each of the following in symbolic form and determine whether they are true or false: 

(a) If a, b e Z, and if a = or b = 0, then a ■ b = 0. 

(b) If a, 6 e Z,and if a • fo = 0, then a = or fo = 0. 

(c) Let a, b e Z.. a ■ b = if and only if a = or b = 0. 

(d) If {5)c Z.then 2 + 3 = 8. 

(e) If 2 + 3 = 8, then the world is flat. 

(f) 5 is an odd integer if and only if 8 is an even integer. 

Section 3.2 

3. Write the truth table for the expression p V 9 A ""' P- 

4. Insert parentheses in the following statements to indicate the order in which the operations are performed: 

(a)pV?Ai'V-? 
(b)pA-9V-pA9 

(c) p V q A r 

(d) pl\q\l P Nr\] ql\r 

Section 3.3 

5. Use truth tables to verify that ((p -» -i q) /\ (q V r ) A ("""")) =* -■ p is a tautology. 

6. Is an implication equivalent to its converse? Verify your answer using a truth table. 

7. Prove that an implication is always equivalent to its contrapositive. 

8. (a) Construct truth tables for the following propositions generated by p, q, and r. 

(i) r A (P A q) (ii) r V (p V <7) (iii) r A 9 
(b) Which of the propositions i, ii, and iii in part (a) imply proposition i? Explain. 

9. Suppose that x is a proposition generated by p and q, and x is equivalent to p -> p A q- What is the truth table for x? 

10. The Scheffer Stroke is the logical operator defined by the following truth table: 

P q Plq 

1 

1 1 

1 1 
1 1 

Truth Table for the Sheffer Stroke 

(a) Prove that p | q is equivalent to -i (p A <?)■ 

The significance of the Sheffer Stroke is that it is a "universal" operator. All other operators can be built from it. 

(b) Prove that -i p <=> p | p. 

(c) Build A using only the Sheffer Stroke. 
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(d) Build V using only the Sheffer Stroke. 

Section 3.4 

1 1 . Write the negation of each of the following statements: 

(a) 3 is a prime number and it is even. 

(b) 4 is a prime number or it is odd. 

(c) If I can exhibit an example of a statement then I have proven it true. 

(d) Ifx 2 - 11 x + 12 = 0,thenx = 3orx = 8. 

12. Prove that all A defined by has the same property as the Sheffere Stroke (see problem 10) in that is is a "universal" operator. 

P 1 pl\q 



1 1 

1 1 

1 1 1 

Truth Table for p J\ q 

13. The following are frequently used and very important tautologies in logic. Use truth tables to prove them. 

(a) (p -> q) ** (-> q -> -. p) 

(b) (p^q)d((p^q)A(q^p)) 

14. Write the following in symbolic notation and prove it is a tautology: "The statement p if and only if q is equivalent to saying if p then q 
and if q then p." 

15 . Write the following in symbolic notation and determine whether it is a valid argument: "If I quit my job, then I will starve. If I don't do my 
work, then I must quit my job. I did my work, therefore I will eat." 

16. Write the dual of each of the following statements: 

(a) hpV0)^l 

(b) (pV<?)A hpVr)=»frV«) 

Section 3.5 

17. Write the following in symbolic form and then determine its validity. "If this car is made in England then parts are difficult to obtain. This 
car is expensive, or it is not difficult to obtain parts. But this car is not expensive. Hence it was not made in England." 

18. In order to attach the Mark 13 printer to the Lemon III computer, you must set eight "dip switches" in the computer according to the 
following rules. The switches are labeled a through h and are set to be either ON or OFF. 

(1) Neither a nor c is set the same as d. 

(2) b and g are different if and only if e and g are in the same positions . 

(3) g is OFF if d is OFF, but g is ON if b is OFF. 

(4) d is ON, unless e is the same as/. 

(5) h is not the same as a if either b or e is OFF. 

(6) g is OFF only if e is not the same as h. 

(7) b,f, and g are not all the same. 
How should the switches be set? 

19. Consider the following argument: 

If person X does not live in France, then X does not speak French. 

X does not drive a Chevrolet. 

If X lives in France, then he rides a bicycle. 

Either X speaks French or he drives a Chevrolet. 

Therefore, X rides a bicycle. 
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Let/? = "X lives in France." 

q = "X speaks French." 

r = "X drives a Chevrolet." 

s = "X rides a bicycle." 
Translate the argument into logical notation using these propositions, and prove it by any method except a truth table. 

20. Determine the validity of the following argument: "I will miss class only if I sleep late. I will not sleep late. Therefore, I will not miss class." 

21 . Mayoral candidate Ms. Perpetual Candidate made the following promise to the voters: "If I am elected, I will bring industry to the town. If I 
bring industry to the town, your taxes will decrease. If your taxes decrease, you will be wealthier. Therefore, if I am elected, you will be 
wealthier." Express this argument in symbolic notation and determine whether the mayoral candidate is telling the truth. 

22. Professor Smoothtalker made the following promise to his class. "If you receive an A in this course, you are happy. You will do all your 
assignments or you are not happy. If you concentrate too hard, you will not do all your assignments. Therefore, if you are happy, do not 
concentrate too hard." Is Professor Smoothtalker' s argument valid? 

23. Determine whether the following argument is valid: Taxes will increase or government spending decreases. Government spending increases 
or more people have jobs. More people do not have jobs or people are rich. Therefore, if taxes decrease, people are rich. 

Section 3.6 

24. Let p (n) be n < 2 and let q (n) be n 2 < 5. 

(a) Over the universe of integers, Z, are p and q equivalent? Does one imply the other? 

(b) Over the universe of natural numbers , fsl , are p and q equivalent? Does one imply the other? 

25. Prove that: T pAq = T p f] T q . 

26. Prove that: T p ^ q = T p c U T q . 

Section 3.7 

27. Express 60 and 120 as a product of primes. 

28. Prove that for n > 1 



Z< 3 = V(» + i) 2 = (%(» + i)) 

29. (a) Prove that (," ] ) + (?) = ("t I for £ > 1 and n > k + 1 . 
(b) Use mathematical induction to prove the binomial theorem: 

(x + y)" = V ( " ) x"- k y k for n > 0. 

30. Use mathematical induction to prove for all n > 1 and for all real numbers c, a t , and b t ,i = 1 , 2, . . . , n: 

(a) Z(a i + b i )= i>,+ £fo, 

1=1 (=1 (=1 

n n 

(b) £ ccij = c 2 «, 

Section 3.8 

31. Write the negation of: "Some sailing is dangerous and all fishing is tedious" in graceful English. 

32. Prove: - ((3 u) (V v) (p)) » (V u) ((3 v) (-. p)) 

33. Translate the following sentences into expressions using quantifiers: 

(a) All fish except sharks are kind to children. 

(b) Either every wine-drinker is very communicative, or some pawn broker is honest and doesn't drink wine. 

(c) If all clever philosophers are cynics, and only women are clever philosophers, then, if there are any clever philosophers, some 
women are cynics . 

34. First write each of the following in logical notation using quantifiers, then write the negative of each logical expression using symbols and 
complete English sentences. If possible, determine which of the statements are true. 
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(a) All people in this classroom are over six feet tall. 

(b) Some of the people in this classroom are over six feet tall and are bald. 

(c) Let U = {2, 3, 4, 5) and let p (n) denote the statement "n is a prime number." Apply the above directions to the statement: Every 
number in U is a prime number. 

(d) All prime numbers are odd. 

(e) You will all pass the course Discrete Structures or you will all fail. 

(f) You can fool some of the people some of the time. 

35. Use quantifiers to state that for every positive integer, there is a larger positive integer. 

36. Over the universe of students in your class, let N (x, y) be "x knows y's name." Interpret in English: 

(a) (3x)((Vy)(N(x, y))) 

(b) (VyKO *)(#(*, y)andx* y)) 

(c) How would you symbolically say that everyone knows everyone's name? 

(d) How would you symbolically say that everyone knows your name? 

(e) How would you symbolically say that someone in the class has amnesia? 

Section 3.9 

37. For any nonzero real number x, 

x 2 = 1 «=> x=lorx = -l 

(a) Outline the logical procedure you would use to prove this statement. 

(b) Fill in the proof. 

38. Let a, b, c e IP and read a | b as "a divides evenly into b." Consider the statements 

(i) b \ a and a | b implies a = b. 

(ii) If p is prime and p | a 2 , then p \ a. 

(a) Are these statements true? Explain your answers . 

(b) Is the converse of each of these statements true? Explain your answers . 

(c) Is the contrapositive of each of these statements true? Explain your answers . 

39. Let a, b e R. A necessary and sufficient condition for a-b = is that a = or b = 0. 

(a) Is this statement true? Explain your answer. 

(b) Outline the logical procedure you would use to prove this statement. 
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Chapter 4 




MORE ON SETS 



GOALS 

In this chapter we shall look more closely at some basic facts about sets. One question we could ask ourselves is: Can we manipulate sets 
similarly to the way we manipulated expressions in basic algebra, or to the way we manipulated propositions in logic? In basic algebra we are 
aware that a ■ (b + c) = a- b + a ■ c for all real numbers a, b, and c. In logic we verified an analogue of this statement, namely, 
P A ( a V '') <=> (p A <?) V (P A '')). where p, q, and;' were arbitrary propositions. If A, B, and C are arbitrary sets, is 
A Pi (B U C) = (A H B) IJ (A H C)l How do we convince ourselves of its truth or falsity? Let us consider some approaches to this 
problem, look at their pros and cons, and determine their validity. Many of the ideas expressed are true, in general, in mathematics. Partitions of 
sets and min sets will be introduced. 



I 4.1 Methods of Proof for Sets 



There are a variety of ways that we could attempt to prove that the distributive law for intersection over union is true; that is, that for any three 
sets, A, B, and C, A f] (B IJ C) = (A f] B) U (A f] C). We start with a common "nonproof" and then work toward more acceptable 
methods . 

EXAMPLES AND COUNTEREXAMPLES 

We could, for example, let A = {1, 2), B = {5, 8, 10), and C = {3, 2, 5), and determine whether the distributive law is true. Obviously, in 
doing this we will have only determined that the distributive law is true for this one example. It does not prove the distributive law for all 
possible sets A, B, and C and hence is an invalid method of proof. However, trying a few examples has considerable merit insofar as it makes us 
more comfortable with the statement in question, and indeed if the statement is not true for the example, we have disproved the statement. 

Definition: Counterexample. An example that disproves a statement is called a counterexample . 

Example 4.1.1. From basic algebra we learned that multiplication is distributive over addition. Is addition distributive over multiplica- 
tion; that is, is a + (b ■ c) = (a + b) ■ {a + c)l If we choose the values a = 3, b = 4, and c = 1, we find that 
3 + (4 • 1) i= (3 + 4) -(3 + 1). Therefore, this set of values serves as a counterexample to a distributive law of addition over multiplication. 

PROOF USING VENN DIAGRAMS 

In this method, we illustrate both sides of the statement via a Venn diagram and determine whether both Venn diagrams give us the same 
"picture," For example, the left side of the distributive law is developed in Figure 4.1.1 and the right side in Figure 4.1.2. Note that the final 
results give you the same shaded area. 



Applied Discrete Structures by A. Doerr & K. Levasseur is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3 .0 United States License. 



79 



Chapter 4 - More on Sets 



79 




V 




|B 


■ilifel 




UPc 





"> 




Figure 4.1 .1 Left side of distributive law developed 

The advantage of this method is that it is relatively quick and mechanical. The disadvantage is that it is workable only if there are a small 
number of sets under consideration. In addition, it doesn't work very well in a static environment like a book or test paper. Venn diagrams tend 
to work well if you have a potentially dynamic environment like a blackboard or video. 




u > 




Figure 4.1.2 Right side of distributive law developed 

PROOF USING SET-MEMBERSHIP TABLES 

Let A be a subset of a universal set U and let we U. To use this method we note that exactly one of the following is true: u E A or u £ A, 
Denote the situation where u e A by 1 and that where u £ A by 0. Working with two sets, A and B, and if u e U, there are four possible 
outcomes of "where u can be." What are they? The set-membership table for A \J B is : 

A B AIJB 



1 1 

1 1 
1 1 1 

This table illustrates that u e A IJ 6 if and only if a e A or u e B. 

In order to prove the distributive law via a set-membership table, write out the table for each side of the set statement to be proved and note that 
if S and T are two columns in a table, then the set statement S is equal to the set statement T if and only if corresponding entries in each row are 
the same. 

To prove AH(B U C) = (AHB) U (A RC), first note that the statement involves three sets, A, B, and C, So there are 2 3 = 8 possibilities for the 
membership of an element in the sets . 
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Since each entry in Column 7 is the same as the corresponding entry in Column 8, we have shown that A \J (B \J C) = (A f] B) \J (A f] C) 
for any sets A, B, and C. The main advantage of this method is that it is mechanical. The main disadvantage is that it is reasonable to use only 

for a relatively small number of sets. If we are trying to prove a statement involving five sets, there are 2 = 32 rows, which would test anyone's 
patience doing the work by hand. 

PROOF USING DEFINITIONS 

This method involves using definitions and basic concepts to prove the given statement. This procedure forces one to learn, relearn, and 
understand basic definitions and concepts . It helps individuals to focus their attention on the main ideas of each topic and therefore is the most 
useful method of proof. One does not learn a topic by memorizing or occasionally glancing at core topics, but by using them in a variety of 
contexts. The word proof panics most people; however, everyone can become comfortable with proofs. Do not expect to prove every statement 
immediately. In fact, it is not our purpose to prove every theorem or fact encountered, only those that illustrate methods and/or basic concepts. 
Throughout the text we will focus in on main techniques of proofs. Let's illustrate by proving the distributive law. 

Proof Technique 1. State or restate the theorem so you understand what is given (the hypothesis) and what you are trying to prove (the 
conclusion). 

Theorem 4.1.1. If A, B, and C are sets, then A fl (B U Q = (A f] B) U (A f] C) 

Assume: A, B , and C are sets. 

Prove: A f] (B (J C) = (A f] B) U (A f] C). 

Commentary: What am I trying to prove? What types of objects am I working with: sets? real numbers? propositions? The answer is sets: sets 
of elements that can be anything you care to imagine. The universe from which we draw our elements plays no part in the proof of this theorem. 

We need to show that the two sets are equal. Let's call them the left-hand set (L.H.S.) and the right-hand set (R.H.S. ) To prove that 
L.H.S. = R.H.S., we must prove two things: (a) L.H.S. c RMS. and (b) RMS. c LMS. 

To prove part a and, similarly, part b, we must show that each element of L.H.S. is an element of R.H.S. Once we have diagnosed the problem 
we are ready to begin. 

Proof of Theorem 4.1.1: We must prove: 

(a) A fl (B U C) c (A fl B) \J (A f] C). 

Let x e A fl (B U Q to show x e (A fl B) U (A fl C). 

x e A fl (B U Q 

Definition of U . f] => x <= A and (x e B or x e C) 

Distributive Law of Logic => (x e A and x e B) or (x e A and ieQ 
Definition of fl => (x e A fl B)or(x e A fl Q 

Definition of U => x e (A f| B) U (A fl Q 

and (b) (A fl B) U (A n C) c A fl (B U C) 



x e (A fl B) U (A n Q 
Why? 
Why? 
Why? 
Why? 

Proof Technique 2 . 



> (x 6 A fl B) or (x e A fl C) 

> (x 6 A and x e B) or (x e A and leC) 
• x e A and (x e B or x e C) 

>x&Af](BDC) m 



(1) To prove that A c B,we must show that if x e A, then x £ B. 

(2) To prove that A = B,we must show: 

(a) A cB, and 

(b) B cA. 

To further illustrate the Proof-by-Definition technique, let's prove the following: 

Theorem 4.1.2. Let A, B, and C be sets, then 
A x (B fl C) = (A x B) fl (A x C). 
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Commentary; We again ask ourselves: What are we trying to prove? What types of objects are we dealing with? We realize that we wish to 

prove two facts: (a) L.H.S. c R.H.S. and (b) R.H.S. c L.H.S. 

To prove part a, and similarly part b, we'll begin the same way. Let 

e LBS. to show e R.H .S. What should be? 

What does a typical object in the L.H.S. look like? 

Proof of Theorem 4.1.2: We must prove: 

(a)Ax(Bp| C)Q(AxB)f)(AxC). 

Let(x, >■) e A x (B f| C) to prove (x, y) e (AxB) f| (AxC). 

(x, y) eAx(B DQ. 

Why? => x e Aandj e (fifl Q. 

Why? => x e A and (v e £ andv e C). 

Why? => (x e Aandv e B)and(x e Aandv e C). 

Why? => (x, y) e (AxB)and(x, >-) e (AxC). 

Why? => (x, v) e (AxB)n(AxC). 

and(b) (Ax B) fl (AxC) c Ax(B f] C). 

Let(x, >■) e (AxB) fl (A x C) to prove (x, y) e A f| (BxC). 

(x, j) e (AxB)n(AxC). 

Why ? ^ (ij)eAxB and (x, >■) e A x C. 

Why? => (x e Aandv e B)and(x e Aandv e C). 

Why? => x e Aand(v e B andj e C). 

Why? => x e A and v e (B fl C). 

Why? => (x, y) e Ax(Bfl C) ■ 

EXERCISES FOR SECTION 4.1 
A Exercises 

1 . Prove the following: 

(a) Let A, B, and C be sets. If A c B and B c C, then A c C. 

(b) Let A and B be sets. Then A - B= A{~\B C . 

(c) Let A, B, and C be sets. If (A c B and A c C) then A c B f| C. 

(d) Let A and B be sets . A c B If and only if B c c A c . 

(e) LetA,B, and Cbe sets. If A c B then AxC c BxC. 

2. Write the converse of parts (a), (c), and (e) of Exercise 1 and prove or disprove them. 

3. Disprove the following, assuming A, B, and Care sets; 

(a) A - B = B - A. 

(b) AxB = BxA. 

(c) A fl B = A fl C implies B = C. 

4. Let A, B, and C be sets. Write the following in "if . . . then . . ." language and prove: 
(a) x e B is a sufficient condition for x e A \J B. 

(b)An#nC = 0isa necessary condition for A f] B =0. 
(c) A IJ B = B is a necessary and sufficient condition for A c B. 

B Exercises 

5. Prove by induction that if A, B { B 2 , . . . , B„, are sets, n > 2, then 

Afi(Bi ub 2 u • • -us*) = (a n«i) u (a nB 2 ) u • • -u(AnB n ) 
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I 4.2 Laws of Set Theory 



The following basic set laws can be derived using either the Basic Definition or the Set-Membership approach and can be illustrated by Venn 
diagrams. 

Commutative Laws 

(1) A U B = B U A (V) A n B = B f| A 

Associative Laws 

(2) A U (B U Q = (A U B) U C (2') A R (B D C) = (A ft B) f] C 

Distributive Laws 

(3) A n (B U Q = (A n B) U (A n O (3') A U (B n C) = (A U B ) n (AU C) 

Identity Laws 

(4) A U = U A = A. (4') Af]U = Ur\A = A 



Complement Laws 

(5) A[jA c = U (5 1 ) A n A c : 



Idem potent Laws 

(6) A IJ A = A (6') A Pi A = A 



Null Laws 

(7) A U U = U (T) A n = I 



Absorption Laws 

(8) A U (A n B) = A. (8') A n (A U B) = A. 



DeMorgan's Laws 

(9) (A IJ B) c = A c f] B c . (9') (A fl Bf = A c {J B c 



Involution Law 

(10) (A c ) c = A. 
It is quite clear that most of these laws resemble or, in fact, are analogues of laws in basic algebra and the algebra of propositions . 

PROOF USING PREVIOUSLY PROVEN THEOREMS 

Once a few basic laws or theorems have been established, we frequently use them to prove additional theorems. This method of proof is 
sometimes more efficient than that of Proof by Definition. To illustrate, let us prove the following: 

Theorem 4.2.1. Let A and B be sets. Then (A fl B) U (A fl B c ) = A. 

Proof : (A fl B) U (A fl B c ) = A fl (B U B c ) Why? 

= Af]U Why? 

= A Why? ■ 

PROOF USING THE INDIRECT METHOD/ CONTRADICTION 

The procedure one most frequently uses to prove a theorem in mathematics is the Direct Method, as illustrated in Theorems 4.1.1 and 4.1.2. 
Occasionally there are situations where this method is not applicable. Consider the following: 

Theorem 4.2.2. Let A, B, C be sets. IfAcBandBf]C = 0, then A f| C = 0. 

Commentary: The usual and first approach would be to assume AcB and B f] C = is true and to attempt to prove A f] C = is true. To 
do this you would need to show that nothing is contained in the set A f] C. Think about how you would show that something doesn't exist. It 
is very difficult to do directly. 
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The Indirect Method is much easier: If we assume the conclusion is false and we obtain a contradiction — then the theorem must be true. This 
approach is on sound logical footing since it is exactly the same method of indirect proof that we discussed in Section 3 .5 . 

Proof of Theorem 4.2.2; Assume A c B and B f] C = 0, and A f] C * 0. To prove that this cannot occur, let x e A f]C. 

Why? => x e A and x e C, 

Why? =* x e B and x e C. 

Why? =* lefiflC. 
But this contradicts the second premise. Hence, the theorem is proven. ■ 

EXERCISES FOR SECTION 4.2 

In the exercises that follow it is most important that you outline the logical procedures or methods you use. 

A Exercises 

1 . (a) Prove the associative law for intersection (Law 2') with a Venn diagram, 
(b) Prove DeMorgan's Law (Law 9) with a membership table. 

(c) Prove the Idempotent Law (Law 6) using basic definitions. 

2. (a) Prove the Absorption Law (Law 8') with a Venn diagram, 
(b) Prove the Identity Law (Law 4) with a membership table. 

(c) Prove the Involution Law (Law 10) using basic definitions. 

3. Prove, using the set theory laws, as well as any other theorems proved so far; 

(a) A U (B - A) = A U B 

(b) A - B = B c - A c . 

(c) AcB, Af)C ±0 ^ Bf)C ±0 

(d) a n (b - o = (a n b) - (a n o. 

(e) A - (B U Q = (A - B) (A - Q 

4. Use previously proven theorems to prove: 

(a) a n (b n cy = (a n b c ) u (a n c c ) 
o>) a n (b n (a n by) = 

(c) (A n B) U B c = A U B c 

(d) A U (B - C) = (A U B) - (C - A). 

5. Hierarchy of Set Operations. The rules that determine the order of evaluation in a set expression that involves more than one operation are 
similar to the rules for logic. In the absence of parentheses , complementations are done first, intersections second, and unions third. Parentheses 
are used to override this order. If the same operation appears two or more consecutive times, evaluate from left to right. In what order are the 
following expressions performed? 

(a) A U B c n C. 

(b) a n b u c n b. 

(c) A U B U C c . 

C Exercise 

6. There are several ways that can be used to format the proofs in this chapter. One that should be familiar to you from Chapter 3 is illustrated 
with the following proof. Alternate proof of part (a) in Theorem 4.1.1: 

(1) x e A Pi (B U Q Premise 

(2) (x e A) A (x e B \J C) (1), definition of intersection 

(3) (x e A) A(C* e B) \J (x e C)) (2), definition of union 

(4) (x e A) A (x e B) V {x e A) A (■* e C) (3), distribute A over V 
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(5) (x e A H B) V (x e A f] C) (4), definition of intersection 

(6) x e (A f]B) U (A P\C) (5), definition of union 
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I 4.3 Minsets 



Let fi[ and B 2 be subsets of a set A. Notice that the Venn diagram of Figure 4.3.1 is naturally partitioned into the subsets A\, A 2 , A 3 , and A 4 . 
Further we observe that A ( , A 2 , A 3 , and A 4 can be described in terms of B t and B 2 as follows: 

A, = B, U B 2 C 

a 2 = b, n B 2 

A 3 =B, c nB 2 
A 4 =B 1 r ns 2 c 




Figure 4.3.1 

Each A; is called a minset generated by B\ and B 2 . We note that each minset is formed by taking the intersection of two sets where each may 
be either B k or its complement B k c Note also, given two sets, there are 2 2 = 4 minsets. 
Minsets are occasionally called minterms. 

The reader should note that if we apply all possible combinations of the operations intersection, union, and complementation to the sets B\ and 
B 2 of Figure 4.3.1 , the smallest sets generated will be exactly the minsets, the minimum sets. Hence the derivation of the term minset. 

Next consider the Venn diagram containing three sets, B\, B 2 , and B 3 . What are the minsets generated by B\, B 2 , and B 3 ? How many are 
there? Following the procedures outlined above, we note that 

Bi n b 2 n bi 
Bj n Bi n b 3 
Bi n b 2 n b c 3 

are three of the 2 3 = 8 minsets. See Exercise 1 of this section. 



Pi D„, where each £),, may be 
-- {1, 3, 5) andB 2 = {1,2, 3). 



Definition: Minset. Let {B h B 2 ,... ,B,J be a set of subsets of a set A. Sets of the form Di f]D 2 f] 
either B t or B\ is called a minset generated by B,, B 2 ,... ,B„. 

Example 4.3.1. For another view, consider the following: Let A = {1, 2, 3, 4, 5, 6) with subsets B L 
How can we, using set operations applied to B\ and B 2 , produce a list of sets that contain elements of A efficiently without duplication? As a 
first attempt, we note that: 

B I nS 2 = {l,3}, 

B\ =(2,4,6) ,and 

B C 2 = {A, 5,6). 

We have produced all elements of A but we have 4 and 6 repeated in two sets. In place of B\ and B 2 , let us try B\ f] B 2 and B[ f] B 2 , 
respectively: 

B\ Pi B 2 = (2) and 

B l C\B\ = {5\. 

We have now produced the elements 1, 2, 3, and 5 using B t f] B 2 , B\ f] B 2 and Bj f] B 2 yet we have not listed the elements 4 and 6. Most 
ways that we could combine B t and B 2 such as B ( IJ B 2 or B t \J B 2 will produce duplications of listed elements and will not produce both 4 
and 6. However we note that B\ f] B 2 = {4, 6), exactly the elements we need. Each element of A appears exactly once in one of the four 
minsets B t f] B 2 , B\ f] B 2 , B[ f] B 2 and B\f]B 2 . Hence, we have a partition of A. 

Theorem 43.1. Let A be a set and let B h B 2 ... , B„ be subsets of A. The set of nonempty minsets generated by B h B 2 ... , B„ is a 
partition of A. 
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The proof of this theorem is left to the reader. The most significant fact about minsets is that any subset of A that can be obtained from B[ , B 2 
... , B„ , using the standard set operations can be obtained in a standard form by taking the union of selected minsets . 

Definition: Minset Normal Form. A set is said to be in minset normal form when it is expressed as the union of zero or more distinct 
nonempty minsets. 

Notes: 

(a) The union of zero sets is the empty set, 0. 

(b) Minset normal form is also called canonical form. 

Example 4.3.2. Let V = (-2,-1,0, 1,2},B,= (0, l,2),andB 2 = (0,2). Then 
B, n B 2 = (0, 2) 

b\ n B 2 = 

B,C\B\ = (1) 

B\C\B\ = (-2,-1) 

In this case, there are only three nonempty minsets, producing the partition ((0, 2), (1), (-2, -1)). An example of a set that could not be 
produced from just Bj and B 2 is the set of even elements of U, (-2, 0, 2). This is because -2 and -1 cannot be separated — they are in the 
same minset and any union of minsets needs either include or exclude them both. In general, there are 2 3 = 8 different minset normal forms 
because there are three nonempty minsets. This means that only 8 of the 2 5 = 32 subsets of U can be generated from B ( and B 2 . 

EXERCISES FOR SECTION 4.3 
A Exercises 

1. Consider the subsets A = (1, 7, 8),B = (1, 6, 9, 10),andC = (1, 9, 10), where U = (1,2, ..., 10). 

(a) List the nonempty minsets generated by A, B, and C. 

(b) How many elements of the power set of U can be generated by A, B, and CI Compare this number with \'P(U) |. Give an example of one 
subset that cannot be generated by A, B, and C. 

2. (a) Partition {1,2, ....9} into the minsets generated by B l = (5, 6,7), B 2 = (2, 4, 5, 9), andB 3 = (3, 4, 5, 6, 8, 9). 
(b) How many different subsets of ( 1 , 2, . . . , 9) can you create using B, , B 2 , and B 3 with the standard set operations? 

(c) Do there exist subsets C\ , C 2 , C 3 whose minsets will generate every subset of ( 1 , 2, . . . , 9)? 

3. Partition the set of strings of 0's and l's of length two or less, using the minsets generated by B] = {s \ s has length 2), and 
B 2 = {s I s starts with a 0). 

4. Let Bi, B 2 , and B 3 be subsets of a universal set U, 

(a) Find all minsets generated by B { , B 2 ,andB 3 . 

(b) Illustrate with a Venn diagram all minsets obtained in part (a). 

(c) Express the following sets in minset normal form: B, , Bi f]B 2 ,Bi {J B 2 . 

5. (a) Partition A = (0, 1, 2, 3, 4, 5) with the minsets generated by B x = (0, 2, 4) and B 2 = (1, 5). 
(b) How many different subsets of A can you generate from B[ andB 2 l 

B Exercises 

6. If (Bi, B 2 , ..., B„) is a partition of A, how many minsets are generated by Bi, B 2 , ..., B„? 

7 . Prove Theorem 4.3.1. 

C Exercise 

8. Let S be a finite set of n elements. Let B; „ i = 1, 2, ... , k be nonempty subsets of S. There are 2 2 minset normal forms generated by the k 
subsets. The number of subsets of 5 is 2". Since we can make 2 2 > 2" by choosing k > log 2 n, it is clear that two distinct minset normal-form 
expressions do not always equal distinct subsets of S. Even for k < log 2 n, it may happen that two distinct minset normal-form expressions 
equal the same subset of S. Determine necessary and sufficient conditions for distinct normal-form expressions to equal distinct subsets of S. 
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I 4.4 The Duality Principle 



In Section 4.2 we observed that each of the set laws labeled 1 through 9 had an analogue 1' through 9'. We notice that each of the laws in one 
column 2 can be obtained from the corresponding law in the other column by replacing IJ by f], (f) by IJ. by U, U by 0, and leaving the 
complement unchanged. 

Definition: Duality Principle for Sets. Let S be any identity involving sets and the operations complement, intersection and union, . If S* 
is obtained from S by making the substitutions IJ -> f], f) ->{J , -» U , and U -» 0, then the Statement S* is also true and it is called the 
dual of the Statement S. 

Example 4.4.1. The dual of (A f] B) U (A f] B c ) = A is (A U B) fl (A U B c ) = A 

One should not underestimate the importance of this concept. It gives us a whole second set of identities, theorems, and concepts. For example, 
we can consider the dual of minsets and minset normal form to obtain what is called maxsets and maxset normal form. 

EXERCISES FOR SECTION 4.4 
A Exercises 

1 . State the dual of: 

(a) A U (B fl A) = A. 

(t>) a u m u a) n by = u, 

(c) (A U B c ) c n B = A c n B 

2. Consider Table 3.4.1 and then write a description of the principle of duality for logic. 

3 . Write the dual of: 

(a) p V - ((- q V P) A q) *=> 1 

(b) (-(/> A (-■«)) V «**(-'/» V q). 

B Exercises 

4. Use the principle of duality and the definition of minset to write the definition of maxset. {Hint, just replaceHby U .) 

5. Let A = {1,2, 3,4, 5, 6} and let B t = {1, 3, 5)and£ 2 = {1,2, 3). Find the maxsets generated by B[ and B 2 . Note the set of maxsets 
does not constitute a partition of A. Can you explain why? 

(a) Write out the definition of maxset normal form. 

(b) Repeat Problem 4 of Section 4.3 for maxsets. 

6. Is the dual of Exercise 5 of Section 4.1 true? 
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| SUPPLEMENTARY EXERCISES FOR CHAPTER 4 



Section 4.1 

1 . Let A and B be subsets of a universal set U. Use basic definitions to prove each statement. Be sure to outline carefully the logical structure 
of the proof. 

(a) AqS if and only if A f) {U - B) = 0. 

(b) If U = A IJ B and A f] B = 0, then A = U - B. 

(c) A and B are disjoint if and only if A c U — B. 

2. Let A and 6 be subsets of a universal set U. Verify the statements using a Venn diagram. 

(a) AqB if and only if A f| B = A. 

(b) AcBifandonlyifA \J B = B. 

(c) A n 5 c A U B. 

3. (a) Prove that if A, B, and C are sets, then (A IJ B) x C is a subset 
of (AxC) U (BxC). 

(b) Explain how you would proceed following part a if you wanted to prove that (A IJ B)xC = (AxC) J (BxC). 

4. Simplify the following: 

(a) (A n B c n C) J B J (B n C) J (A r n C). 

(b) (a j (b n c)) n «a c n b c ) j q. 

Section 4.2 

5. Prove with an indirect proof ihat if A, B, and Care subsets of universe [7, A is a subset of B, and B is a subset of C, then C c is a subset of A c . 

6. Basic laws and theorems in different algebraic structures can be recalled easily by thinking in terms of analogous situations in elementary 
algebra. 

(a) Complete the following table: 





Algebra 
of Sets 


Algebra 
of Logic 


Elementary 
Algebra 


Objects Used 
Basic Operations 


Sets 

u 


A 


Real Numbers 








- (or multiplicative 


Other Connectives 


c 


<=> 


inverse 



(b) Write analogous statements in the algebras of sets and logic. 

Which are true? 

(i) If x, y, and z are real numbers and if x < y and y ^ z, then x ^ z. 

(ii) -(— x) = x for any real number x. 

(iii) If x , y, and z are real numbers and x + y = x + z, then y = z . 

(iv)Letx, y e R.x = y if and only if x ^ y and y ^ x. 

(v) For x e IR , x + = + x = x and x ■ 1 = 1 • x = x. 

1. Prove or disprove: 

(a) LetA,B,andCbesets.IfA J C t B J C.thenA t B. 

(b) If A * B, then A' =/=B c . 
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8. Let {Ai, A 2 , . . .A„) be a partition of set A, and let B be any nonempty subset of A, Prove that {A; f] B | A; f] B £ 0) is a partition of A f] B. 

Section 4.3 

9. Let U = {1, 2, 3, 4, 5, 6},Bi = (1, 3),andB 2 = {2, 3, 5). 

(a) List the minsets generated by B[ and B 2 . 

(b) Show that the set of minsets form a partition of U. 

Section 4.4 

10. State the dual of each statement in Exercise 3, Section 4.2. 

1 1 . Show that the dual of each of the basic set laws 1 through 9 are the corresponding laws 1' through 9'. 
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chapter 5 




INTRODUCTION TO MATRIX ALGEBRA 

GOALS 

The purpose of this chapter is to introduce you to matrix algebra, which has many applications. You are already familiar with several algebras: 
elementary algebra, the algebra of logic, the algebra of sets. We hope that as you studied the algebra of logic and the algebra of sets, you 
compared them with elementary algebra and noted that the basic laws of each are similar. We will see that matrix algebra is also similar. As in 
previous discussions, we begin by defining the objects in question and the basic operations. 



I 5.1 Basic Definitions 



Definition: Matrix. A matrix is a rectangular array of elements of the form 



"11 


"l2 


"ll ■ 


■■ "in 


"21 


"22 


"21 ■ 


■■ "2n 


<*31 


"l2 


"ll ■ 


■■ "in 



, "m\ "m2 "ml '" "m 

A convenient way of describing a matrix in general is to designate each entry via its position in the array. That is, the entry a 34 is the entry in 
the third row and fourth column of the matrix A. Depending on the situation, we will decide in advance to which set the entries in a matrix will 
belong. For example, we might assume that each entry a V] (1 < i < m, 1 < j < n) is a real number. In that case we would use M„, X „(IR) to 

stand for the set of all m by n matrices whose entries are real numbers . If we decide that the entries in a matrix must come from a set 5, we use 
M mx „(S) to denote all such matrices. 

Definition: Order of a Matrix. The matrix A that has m rows and n columns is called an mxn (read "m by n") matrix, and is said to 
have order mxn. 

Since it is rather cumbersome to write out the large rectangular array above each time we wish to discuss the generalized form of a matrix, it is 

common practice to replace the above by A = [fly]. In general, matrices are often given names that are capital letters and the corresponding 

lower case letter is used for individual entries. For example the entry in the third row, second column of a matrix called C would be C32. 

Example 5.1.1. 









C > 




n 


2 5 } 


> 


-s \ 
















,B = 




,and£> = 


6 


-2 3 


u 


-3 




2 

, 15, 




u 


2 8 , 



are 2 x 2 , 3 x 1 , and 3x3 matrices , respectively 

Since we now understand what a matrix looks like, we are in a position to investigate the operations of matrix algebra for which users have 
found the most applications. 



Example 5.1.2. First we ask ourselves: Is the matrix A 

1 2 



1 2 
3 4 



the matrix B 



3 5 



equal to 
? No, they are not because the corresponding entries in the second row, second column of the two matrices are not 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3 .0 United States 



91 91 

Chapter 5 - Matrix Algebra 



(l 2 3 \ 1 1 2 \ , ,. . . , ., . , 

equal. Next, is A = equal to B = I ? No, although the corresponding entries in the first two columns are identical, B doesn't 

have a third column to compart to that of A. We formalize these observations in the following definition. 

Definition; Equality of Matrices . A matrix A is said to be equal to matrix B (written A = B) if and only if: 

(1) A and B have the same order, and 

(2) all corresponding entries are equal: that is, ay = by for all appropriate i andj. 

I 5.2 Addition and Scalar Multiplication 

Example 5.2.1. Concerning addition, it seems natural that if 

(I \ / 3 4 \ / 1 +3 0+4 \ / 4 4 

/ 1 2 3 \ (3 0\ . , . .,., 

However, if A = and B = I , can we find A + Bl No, the orders of the two matrices must be identical. 

Definition: Matrix Addition. Let A and B be mxn matrices. Then A + B is an m xn matrix where (A + B)y = ay + by (read "the ith 

jth entry of the matrix A + B is obtained by adding the ith jth entry of A to the ith jth entry ofB"). If the orders of A and B are not identical, 
A + B is not i ' 



It should be clear from Example 5.2.1 and the definition of addition that A + B is defined if and only if A and B are of the same order. 

Another frequently used operation is that of multiplying a matrix by a number, commonly called a scalar in this context. Scalars normally come 
from the same set as the entries in a matrix. For example, if A e M,„ X „(R), a scalar can be any real number. 

/ 1 -2 \ 
Example 5.2.2. If c = 3 and if A = and we wish to find cA, it seems natural to multiply each entry of A by 3 so that 

,, /3 -6\ . , ,.. , 

3 A = , and this is precisely the way scalar multiplication is defined. 

Definition: Scalar Multiplication. Let A be an m x n matrix and c a scalar. Then c A is the mxn matrix obtained by multiplying c times 
each entry of A: that is (cA)y = c ay. 



I 5.3 Multiplication of Matrices 



I - For a video introduction to this section, go to http://faculty.uml.edu/klevasseur/ads2/videos/matrixmultiplication/ 

A definition that is more awkward to motivate (and we will not attempt to do so here) is the product of two matrices. In time, the reader will see 
that the following definition of the product of matrices will be very useful, and will provide an algebraic system that is quite similar to elemen- 
tary algebra. 

Definition: Matrix Multiplication. Let A be an m xn matrix and let B be an n x p matrix. The product of A and B, denoted by AB, is an 
mxp matrix whose ith row jth column entry is 

(AB)ij = a n b,j +a i2 b 2j + ■■■ +a in b nj 

n 

= Y,a ik b kj 
k=l 

for 1 < i < m and 1 < j < p. 

The mechanics of computing one entry in the product of two matrices is illustrated in Figure 5.3.1 . 
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Row J— - 



r 

1 - 1 

L _ J 




Col. 2 
\ 

r ' i 

-6 : 2 : 4 


-I 2-1 




3 ! 3| 6 


1 I 




I • 4 ! 5 



Row I, Col. 2 of Product 

I 

* (D<2) + C -1><3> + (0K4) * 



Figure 5.3.1 Computation of one entry in the product of two 3 by 3 matrices 

The computation of a product can take a considerable amount of time in comparison to the time required to add two matrices. Suppose that A 
and B are nxn matrices; then (AB), ; is determined performing n multiplications and n - 1 additions. The full product takes n 3 multiplications 

and n 3 — n 2 additions. This compares with n 2 additions for the sum of two nxn matrices. The product of two 10 by 10 matrices will require 
1 ,000 multiplications and 900 additions, clearly a job that you would assign to a computer. The sum of two matrices requires a more modest 
100 additions. This analysis is based on the assumption that matrix multiplication will be done using the formula that is given in the definition. 
There are more advanced methods that, in theory, reduce operation counts. For example, Strassen's algorithm 

(http://mathworld.wolfram.com/StrassenFormulas.html) computes the product of two n by n matrices in 7 -l' ^" - 6- 4 lo S2" « 7 n 2 - 808 opera- 
tions. There are practical issues involved in actually using the algorithm in many situations. For example, round-off error can be more of a 
problem than with the standard formula. 

1 ^ 



Example 5.3.1. Let A 



, a 3 x 2 matrix, an let B 



, a 2 x 1 matrix. Then A B is a 3 X 1 matrix: 



/ 1 > 

3 2 


(?)-( 


1-51; 



1-6 + 01 } 




f 6 ) 


2-1+3-6 


= 


20 


5 -6 + 1-1 J 




1-29 J 



AB 



Remarks: 

(1) The product AB is defined only if A is an mxn matrix and B is an nxp matrix; that is, the two "inner" numbers must be the equal. 
Furthermore, the order of the product matrix AS is the "outer" numbers, in this case mxp. 

(2) It is wise to first determine the order of a product matrix. For example, if A is a 3x2 matrix and B is a 2x2 matrix, then AB is a 3x2 
matrix of the form 

'en c 12 
AB = c 2 i c 2 2 
,c 3 i c 32 ; 

Then to obtain, for example, C31 , we multiply corresponding entries in the third row of A times the first column of B and add the results. 
Example 5.3.2. 
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,10, 
Lei ,1 =| Q 3 |, and B 



AB 



3 
2 1 

1-3 + 0-2 1-0 + 0-1 
0-3 + 3-2 0-0 + 3-1 

3 



. Then 



3 
6 3 



Note: B A = \ * AB 

Remarks; 

(1) An nXn matrix is called a square matrix. 

(2) If A is a square matrix , A A is defined and is denoted by A 2 , and AAA = A 3 , etc . 

(3) The mxn matrices whose entries are all are denoted by mx „ , or simply 0, when no confusion arises regarding the order. 



, and C -- 



1 -1 

3-2 2 



:A + cB. 



EXERCISES FOR SECTIONS 5.1 THROUGH 5.3 
A Exercises 

/ 1 -1 \ (0 1 

l.LetA = „ „ , B = I r 

\2 3 ) 1 3 -5 

(a) Compute A B and B A. 

(b) Compute A + B and B + A. 

(c) Ifc = 3, show that c(A + B) = 

(d) Show that (AB)C=A(fiC). 

(e) Compute A 2 C. 

(f) Compute B + 

(g) Compute A 2x 2 and 2X 2 A. where 2x 2 is the 2 x 2 zero matrix, 
(h) Compute A, where is the real number (scalar) zero. 
(i)Letc = 2 and d = 3.Showthat(c + d) A = cA + dA. 







' 1 2 > 




( 2 3 -i 




f 2 1 2 3) 


2. Let A = 


2-15 


,B = 


1 1 2 


, and C = 


4 11 




,3 2 1 , 




, -1 3 -2, 




,3-141, 


Compute, if possible; 








(a) A - B 








(b) AB 








(c) AC - BC 








(d) A(BQ 








(e) CA - CB 










( x - 








(f) c 


y 

z 












,w , 













3. Let A 



2 
3 



. Find a matrix B such that AB = I and B A = I, where / = 



1 
1 



4. Find A / and B / where / is as in Exercise 3, where 
A = { „ , I and B 



1 8 
,9 5 
What do you notice? 



-2 3 
5 -7 
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5. Find A 3 if A: 



'10 

2 
,0 3 I 



. What is A 15 equal to? 



B Exercises 

(1 o i 
6. (a) Determine I 2 and I 3 if / = 1 . 

,001, 

(b) What is /" equal to for any n>\l 

(c) Prove your answer to part (b) by induction. 



7. (a) If A 



2 1 



X 



x\ 



andB 



1 -1 ) U2/ V 1 

(b) Express the following systems of equations using matrices: 
(i) 2*i- x 2 = 4 

Xl + X2 = 

(ii) X| + ^2 + 2x3 = 1 

X| + 2X2 _ X-i = -1 

X[ + 3 X2 + X3 = 5 
(iii) Xi + X2 =3 

X2 =5 

X[ + 3 X3 = 6 



show that AX = B is a way of expressing the system 2xi + X2 = 3 using matrices . 

X\- Xi= 1 
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I 5.4 Special Types of Matrices 



We have already investigated one special type of matrix, namely the zero matrix, and found that it behaves in matrix algebra in an analogous 
fashion to the real number 0; that is, as the additive identity. We will now investigate the properties of a few other special matrices. 

Definition: Diagonal Matrix. A square matrix D is called a diagonal matrix ifdy = whenever i # j. 



are all diagonal matrices. 



In Example 5.4.1, the 3x3 diagonal matrix / whose diagonal entries are all l's has the singular property that for any other 3x3 matrix A we 
have A I = I A = A. For example: 



ampl 


5 5.4.1. 












t 1 > 




(3 \ 


M 0i 


A = 


2 


, B = 





, and / = 


1 




(0 5 i 




(0 -5 J 




l0 1; 



Example 5.4.2. If A 



AI 







< 1 2 


If A 


= 


6 7 

,3 - 


< 1 


2 5 ^ 


6 


7 -2 


1,3 


-3 J 


< 1 


2 5 \ 


6 


7 -2 


(3 


-3 


) 



5 ^ 



-3 



and 



then 



IA 



In other words, the matrix / behaves in matrix algebra like the real number 1; that is, as a multiplicative identity. In matrix algebra the matrix / 
is called simply the identity matrix. Convince yourself that if A is any nxn matrix A / = I A = A. 

Definition: Identity Matrix. The nxn diagonal matrix whose diagonal components are all l's is called the identity matrix and is 
denoted by I or I n . 

In the set of real numbers we realize that, given a nonzero real number x, there exists a real number y such that xy = y x = \. We know that 

real numbers commute under multiplication so that the two equations can be summarized as x y = 1 . Further we know that y = x~ ' = - . Do 

we have an analogous situation in M nxn (R)7 Can we define the multiplicative inverse of an nxn matrix A? It seems natural to imitate the 
definition of multiplicative inverse in the real numbers . 

Definition: Matrix Inverse. Let Abe an nxn matrix. If there exists an nxn matrix B such that AB = B A = I, then B is the multiplica- 
tive inverse of A (called simply the inverse of A) and is denoted by A~ l (read "A inverse ") . 

When we are doing computations involving matrices, it would be helpful to know that when we find A -1 , the answer we obtain is the only 
inverse of the given matrix. 

Remark: Those unfamiliar with the laws of matrices should go over the proof of Theorem 5.4.1 after they have familiarized themselves with the 
Laws of Matrix Algebra in Section 5 .5 . 

Theorem 5.4.1. The inverse of an nxn matrix A, when it exists, is unique. 

Proof: Let A be an n x n matrix. Assume to the contrary, that A has two (different) inverses, say B and C. Then 



B = Bl 

= B(AC) 
= (BA)C 
= IC 

= c 

Example 5.4.3. Let A = 



Identity property of / 
Assumption that C is an inverse of A 
Associativity of matrix multiplication 
Assumption that B is an inverse of A 
Identity property of / ■ 



2 
3 



What is A ' ? Without too much difficulty, by trial and error, we determine that A 



2 



1 



This 



might lead us to guess that the inverse is found by taking the reciprocal of all nonzero entries of a matrix. Alas, it isn't that easy! 

' ' \\ 

2 
1 1 



If 



1 2 

-3 5 



, the "reciprocal rule" would tell us that the inverse of A is B •■ 



\ 3 



Try computing A B and you will see that you don't get 



the identity matrix. So, what is A -1 ? In order to understand more completely the notion of the inverse of a matrix, it would be beneficial to 
have a formula that would enable us to compute the inverse of at least a 2x2 matrix. To do this, we need to recall the definition of the determi- 
nant of a 2x2 matrix. Appendix A gives a more complete description of the determinant of a 2x2 and higher-order matrices. 

I a b\ 
Definition: Determinant of a 2 x2 Matrix. Let A =\ \. The determinant of A is the number det A = ad -be. 

\c d I 

In addition to det A, common notation for the determinant of matrix A is \A\. This is particularly common when writing out the whole matrix, 
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which case we would write 
Example 5.4.4. 

1 2 



a b 
c d 



for the determinant of the general 2x2 matrix. 



If A = 
IfB = 



-3 5 

1 2 

2 4 



then det A 
then det B = 



1-5 -2- (-3) = 11. 



1-4 -2-2 = 



a b 

c d 



l ( d -b 

det A \ -C a 



Theorem 5.4.2. Let A =| " ". |. If det A t 0, then A ' 

Proof: See Exercise 4 at the end of this section. 

Example 5.4.5. Can we find the inverses of the matrices in Example 5.4.4? 



If A 



1 2 

-3 5 



then A 



j_/5 -2 
ii V 3 1 



/. 



n 

ii 



_2_ -* 

ii 



The reader should verify that A A ' = A ' A 

The second matrix, B has a determinant equal to zero. We we tried to apply the formula in Theorem 5.4.2, we would be dividing by zero. 

For this reason, the formula can't be applied and in fact ZT 1 does not exist. 

Remarks: 

(1) In general, if A is a 2x2 matrix and if det A = 0, then A~ l does not exist. 

(2) A formula for the inverse oinxn matrices n > 3 can be derived that also involves det A , Hence , in general , if the determinant of a matrix is 
zero, the matrix does not have an inverse. However the formula for even a 3 x 3 matrix is very long and is not the most efficient way to 
compute the inverse of a matrix. 

(3) In Chapter 12 we will develop a technique to compute the inverse of a higher-order matrix, if it exists. 

(4) Matrix inversion comes first in the hierarchy of matrix operations ; therefore , A B~ ' is A(B~ ' ) . 

EXERCISES FOR SECTION 5.4 
A Exercises 

1. For the given matrices A find A -1 if it exists and verify that AA~ [ = A~ [ A = I If A -1 does not exist explain why. 

1 3 
,2 1 

6-3 

8 -4 

1 -3 
,0 1 

1 
,0 1 

(e) Use the definition of the inverse of a matrix to find A~' : 

(3 ^ 

0-0 

2 

{0 -5 ) 

2. For the given matrices A find A~' if it exists and verify that AA~ [ = A~' A = I If A -1 does not exist explain why. 



(a) A = 

(b) A 

(c) A 

(d) A 



(a) A = 

(b) A = 

(c) A = 



2 -1 
-1 2 

1 

2 

1 c 
1 
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I a b \ 
(d) A = , , were a > b>0. 

\b a ) 

3. (a)LetA = ( ]andB=[ ~ J. Verify that (A B)" 1 = B~ l A" 1 . 

(b) Let A and B be nxn invertible matrices. Prove that (AB) _1 = B~ [ A~ [ . Why is the right side of the above statement written "backwards"? 
Is this necessary? Hint: Use Theorem 5.4.1. 

B Exercises 

I a b \ _, 

4. Let Let A = , . Derive the formula for A . 

V c d I 

5. (a) Let A and B be as in problem 3 above. Show that det(A B) = (detA) (detB). 

(b) It can be shown that the statement in part (a) is true for all nxn matrices. Let A be any invertible nxn matrix. Prove that 
det(A _1 ) = (det A)~' . Note: The determinant of the identity matrix /„ is 1 for all n, see Appendix A for details, 
(c) Verify that the equation in part (b) is true for the matrix in exercise 1(a) of this section. 

I a \" la" 

6. Prove by induction that for n > 1, I „ , = I „ ,„ 

V b ) \ b 

7. Use the assumptions in exercise 5 to prove by induction that if n > 1, det(A") = (detA)". 

8 . Prove: If the determinant of a matrix A is zero , then A does not have an inverse . Hint: Use the indirect method of proof and exercise 5 . 

C Exercise 

9. (a) Let A, B, and/J be nxn matrices. Assume that B is invertible. If A = B DB [ , prove by induction that A m = B D m B~ l is true for 
m > 1. 

(b) Given that A = [ ~ ) = b( ]s- 1 whereB = [ j what is A 10 ? 
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I 5.5 Laws of Matrix Algebra 



The following is a summary of the basic laws of matrix operations. Assume that the indicated operations are defined; that is, that the orders of 
the matrices A, B, and C are such that the operations make sense. 



(1) A + B = B + A 


(2) A + (B + C) = (A 


+ B) + C 




(3) c(A + B) = cA + 


c B, where c e 


R. 


(4) (ci + c 2 )A = c\ A 


+ t'2 A, where c\ , ci e R. 


(5) C\(c2A) = (ci -c<i ) A, where c\, c 2 


e R. 


(6) OA = 0, where is i 


the zero matrix. 




(7) OA = 0, where on 


the left is the number 0. 


(8) A + = A. 


(9) A + (-l)A = 0. 


(10) A(B + C) = AB 


+ AC. 




(11) (B + C)A = BA 


+ CA. 




(12) A(BC) = (AB)C. 


(13) I A = A and AI = 


A. 




(14) If A" 1 exists. (A" 1 )" 


1 =A 




(15) If A" 1 andfi- 1 exist 


, (AB)- [ = B-' 


A- 1 



Example 5.5.1. If we wished to write out each of the above laws more completely, we would specify the orders of the matrices. For 
example. Law 10 should read: 

(10) Let A, B, and Cbe mxn,nxp, and nxp matrices, respectively, then A (S + C) = AB + AC 
Remarks: 

(1) Notice the absence of the "law" AB = BA. Why? 

(2) Is it really necessary to have both a right (No. 1 1) and a left (No. 10) distributive law? Why? 

(3) What does Law 8 define? What does Law 9 define? 

EXERCISES FOR SECTION 5.5 
A Exercises 

1 . Rewrite the above laws specifying as in Example 5 .5 . 1 the orders of the matrices . 

2. Verify each of the Laws of Matrix Algebra using examples. 
1 2 \ / 3 7 6 \ / -2 



3. Let A = ,fi= Land C= ]. Compute the following as efficiently as possible by using any of the Laws of 

Matrix Algebra: 

(a) AB + AC 

(b) A-> 

(c) A(B + C) 
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(d) (A 2 )" 1 

(e) (C + Br 1 A" 1 

4. Let A =1 andfi =1 I. Compute the following as efficiently as possible by using any of the Laws of Matrix Algebra: 

(a) AB 

(b) A + B 

(c) A 2 + AB + BA + B 2 

(d) fl-'A-' 

(e) A 2 + AB 

5 . Let A and Bbsnxn matrices of real numbers .Is A 2 - B 2 = (A- B)(A + B)l Explain 
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I 5.6 Matrix Oddities 



We have seen that matrix algebra is similar in many ways to elementary algebra. Indeed, if we want to solve the matrix equation AX = B for 
the unknown X, we imitate the procedure used in elementary algebra for solving the equation ax - b. Notice how exactly the same properties 
are used in the following detailed solutions of both equations. 

Solution of a x = b Solution of A X = B 



ax = b 

a' 1 (ax) = cT 1 b if a =/= 

(a~ [ a)x = a~ ] b associative law 

1 x = a -1 b definition of inverse 

x = a -1 b identity property of 1 



AX = B 

A- 1 (AX) = A- 1 B if A- 1 exists 

(A~ [ A)X = A~ [ B associative law 

IX = A~ l B definition of inverse 

X = A~ [ B identity property of / 



Certainly the solution process for AX = B is, the same as that of a x = b. 

The solution of xa = b is x = fcor 1 = a -1 b. In fact, we usually write the solution of both equations as x = -. In matrix algebra, the solution of 

X A = B is X = B A , which is not necessarily equal to A~ l B. So in matrix algebra, since the commutative law (under multiplication) is not 
true, we have to be more careful in the methods we use to solve equations. 

It is clear from the above that if we wrote the solution of A X = B as X = — , we would not know how to interpret the answer - . Does it mean 

A~ ' B or B A" ' ? Because of this , A" ' is never written as - . 



Some of the main dissimilarities between matrix algebra and elementary algebra are that in matrix algebra: 

( 1 ) A B may be different from B A . 

(2) There exist matrices A and B such that A B = 0, and yet A =/= and B t 0. 

(3) There exist matrices A where A 4= 0, and yet A 2 = 0. 

(4) There exist matrices A where A 2 - A with A + I and A t 

(5) There exist matrices A where A 2 = / , where A =£ I and A + -I 



EXERCISES FOR SECTION 5.6 
A Exercises 

1 . Discuss each of the above "oddities" with respect to elementary algebra. 

2. Determine 2x2 matrices which show each of the above "oddities" are true. 
B Exercises 

3. Prove the following implications, if possible: 

(a) A 2 = A and det A + => A = / 

(b) A 2 = /anddetA t => A = /orA = -/. 

4. Let M, !X „(IR) be the set of real nXn matrices. Let P c M, !Xn (IR) be the subset of matrices defined by A e P if and only if A 2 = A. Let 
Q c P be defined by A e Q if and only if det A * 0. 

(a) Determine the cardinality of Q. 

(b) Consider the special case n = 2 and prove that a sufficient condition for A e P c M 2X 2(IR) is that A has a zero determinant (i.e., A is 
singular) and tr (A) = 1 where tr (A) = a 1 1 + a 22 is the sum of the main diagonal elements of A . 

(c) Is the condition of part b a necessary condition? 
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C Exercises 




5 . Write each of the following systems in the form A X 


= B, and then solve the systems using matrices 


(a) 1x\ + X2 = 3 




X\ — X2 — 1 




(b) 2 xi - x 2 = 4 




Xi - X2 = 




(c) 2 X\ + X2 = 1 




JC] - X2 = 1 




(d) 2 xi + x 2 = 1 




X\ - X2= -1 




(e) 3 X\ + 2 X2 = 1 




6xi + 4x2 = ~1 





6. Recall that p (x) = x — 5 x + 6 is called a polynomial, or more specifically, a polynomial over R, where the coefficients are elements of R 
and x e R. Also, think of the method of solving, and solutions of, x 2 - 5 x + 6 = 0. We would like to define the analogous situation for 2x2 
matrices. First define where A is a 2x2 matrix p(A) = A 2 - 5 A + 61. Discuss the method of solving and the solutions of 
A 2 - 5A + 67 = 0. 

7. (For those who know calculus) 

(a) Write the series expansion for e " centered around a = 0. 

(b) Use the idea of exercise 6 to write what would be a plausible definion of e A where A is an n X n matrix. 

(c) If A = I and B = „ „ , use the series in part (b) to show that e A = I „ and e B = I „ 

(d) Show that e A e B t e B e A 

(e) Show that e A+B 



(f) Is e A e B 



A+B 1 
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| SUPPLEMENTARY EXERCISES FOR CHAPTER 5 



Sections 5.1 through 5.3 

1 . Determine x and y in the following: 



( 1 



2. Let A: 



-1 \ 



B 



to 


2 \ 




r3 ^ 


3 -1 


1 


,c = 


4 


U 2 


-1 J 




v5 J 



x + j 5 
-2 x - y 

Compute: 



3 5 
-2 4 



2 1 5 

3-4 i ; 

(a)2A-3B 
(b)2A-5A 
(c) AC + BC 

3 . Let A and £ be two mxm matrices with AB = BA . Prove by induction on n that AB" = B" A for n greater than or equal to 1 . 

4. Prove by induction that if n is a positive integer, and 



1 1 


1 


\ 





1 


1 


lo 





1 J 



, then A" •■ 



'1 n n(n-\)l2 
1 n 

,0 1 



Section 5.4 

5. Determine A~' A 3 if A : 

4 -2 



2 3 
1 4 

2 



6.LetA = ^_ 2 5 jandB = u J 

Compute A + B, A 2 + AB +BA + B 2 , and B l A~' . You may save some time by thinking before plunging into the computations. 

7. For what real number c will the matrix D have no inverse? Explain your answer. 
(3 15 



8. LetP: 



a b 



'■ ^2x2(K) ad * be 



Fact: The inverse of a diagonal matrix belonging to P can be found simply by reciprocating the diagonal elements of the matrix. 

3 ^' 



(a) Determine 

(b) Suppose 



6 

a b 
c d 



;Pand 



a b 



l/a b 

c l/d 



a b 



a b 



In general, is a diagonal matrix? If yes, explain why; if no, give the most general form of such a matrix . 

Section 5.5 

9 . (a) Let A and Bbenxn matrices . Expand (A + B) 2 . 
(b) Is (A + Bf ever equal to A 2 + 2 AB + B 2 ? Explain. 

10. Solve the following matrix equation for X. Be careful to explain under which conditions each step is possible. 

AX + C = BX 

Section 5.6 

1 1 . Prove or disprove: A~' = A and B l = B => (ABy 1 = AB. 

12. The following is true for all real numbers a and b: a ■ b = if and only if a = or b = 0. Is any part of this statement true for n x n matrices A 
and fi? Explain. Give an example and proof. 



13. Let A = | ], where a, b, c, d e R. Show that the matrices of the form A : 



1 
c -1 



and A = 



1 b 
-1 



are also solutions to the 



equation A 2 = /, confirming that a quadratic matrix equation can have an infinite number of solutions. Are there any others? 
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RELATIONS AND GRAPHS 



GOALS 

One understands a set of objects completely only if the structure of that set is made clear by the interrelationships between its elements. For 
example, the individuals in a crowd can be compared by height, by age, or through any number of other criteria. In mathematics, such compar- 
isons are called relations. The goal of this chapter is to develop the language, tools, and concepts of relations. 



I 6.1 Basic Definitions 



In Chapter 1 we introduced the concept of the Cartesian product of sets. Let's assume that a person owns three shirts and two pairs of slacks. 
More precisely, let A = (blue shirt, tan shirt, mint green shirt) and B = {grey slacks, tan slacks). Then certainly A x B is the set of all possible 
combinations (six) of shirts and slacks that the individual can wear. However, the individual may wish to restrict himself or herself to combina- 
tions which are color coordinated, or "related." This may not be all possible pairs in AxB but will certainly be a subset of AxB. For example, 
one such subset may be {(blue shirt, grey slacks), (blue shirt, tan slacks), (mint green shirt, tan slacks)). 

Definition: Relation. Let A and B be sets. A relation from A into B is any subset of AxB. 

Example 6.1.1. Let A = {1, 2, 3)andS = {4, 5). Then {(1, 4), (2, 4), (3, 5)) is a relation from A into B. Of course, there are many 
others we could describe; 64, to be exact. 

Example 6.1.2. Let A = {2, 3, 5, 6) and define a relation r from A into A by (a, b) e r if and only if a divides evenly into b. So 

r = {(2, 2), (3, 3), (5, 5), (6, 6), (2, 6), (3, 6)). 

Definition: Relation on a Set. A relation from a set A into itself is called a relation on A. 

The relation "divides" in Example 6.1 .2 is will appear throughout the book. Here is a general definition on the whole set of integers. 

Definition: Divides. Let a, b e Z. 

a | b if and only if there exists an integer k such that a k = b. 

Based on the equation a k = b,we can say that a \ b is equivalent to k = -, or a divides evenly into b. In fact the "divides" is short for "divides 

evenly into." You might find the equation k = - initially easier to understand, but in the long run we will find the equation ak = b more 

useful. 

Sometimes it is helpful to illustrate a relation. Consider Example 6.1 .1 . A picture of /' can be drawn as in Figure 6.1.1. The arrows indicate that 
1 is related to 4 under r. Also, 2 is related to 4 under r, and 3 is related to 5, while the upper arrow denotes that r is a relation from the whole set 
A into the set B . 

A typical element in a relation r is an ordered pair (x, y). In some cases, r can be described by actually listing the pairs which are in r, as in the 
previous examples. This may not be convenient if r is relatively large. Other notations are used depending on personal preference or past 
practice. Consider the following relations on the real numbers: 
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FIGURE 6.1 .1 Illustration of a relation 

r = {(x, y) | y is the square of x) , and s = {(x, y) | x < }>}. 

The notation (4, 16) e r or (3, 7.2) e s makes sense in both cases. However, r would be more naturally expressed as r(x) = x 2 or r(x) = y, 
where y = x 2 . But this notation when used for s is at best awkward. The notation x < y is clear and self-explanatory; it is a better notation to 
use than (x, y) e s. 

Many of the relations we will work with "resemble" the relation <, SO x sy is a common way to express the fact that x is related to y through the 
relation s. 

Relation Notion. Let s be a relation from a set A into a set B. Then the fact that (x, y) e s is frequently written xsy. 

Let A = {2, 3, 5, 8}, B = {4, 6, 16), and C = (1, 4, 5, 7}; let r be the relation "divides," denoted by |, from A into B; and let s be the 
relation < from B into C. So r = {(2, 4), (2, 6), (2, 16), (3, 6), (8, 16)} and s = {(4, 4), (4, 5), (4, 7), (6, 7)}. 

Notice from Figure 6.1 .2 that we can, for certain elements of A, go through elements in B to results in C. That is: 

2 | 4and4 < 4 
2 | 4and4 < 5 
2 | 4and4 < 7 

2 | 6and6 < 7 

3 | 6 and 6 < 7 

Based on this observation, we can define a new relation, call it rs, from A into C. In order for (a, c) to be in rs, it must be possible to travel 
along a path in Figure 6.1.2 from a to c. In other words, (a, c) e rs if and only if (3 b) B (arb and bsc). The name rs was chosen solely 
because it reminds us that this new relation was formed by the two previous relations r and s. The complete listing of all elements in rs is 
{(2, 4), (2, 5), (2, 7), (3, 7)). We summarize in a definition. 




FIGURE 6.1 .2 Illustration of relation "divides" 

Definition: Composition of Relations. Let r be a relation from a set A into a set B, and let s be a relation from B into a set C. The 
composition of r and s, written rs, is the set of pairs of the form (a, c) e AxC, where (a, c) £ rs if and only if there exists b £ B such that 
(a, b) erand(b, c) es. 

Remark: A word of warning to those readers familiar with composition of functions. (For those who are not, disregard this remark. It will be 
repeated at an appropriate place in Chapter 7.) As indicated above, the traditional way of describing a composition of two relations is rs where r 
is the first relation and s the second. However, function composition is traditionally expressed "backwards"; that is, as sr (or s°r), where r is the 
first function and s is the second. 

EXERCISES FOR SECTION 6.1 
A Exercises 

1 . For each of the following relations r defined on P, determine which of the given ordered pairs belong to r. 

(a) xry iff x\y; (2, 3), (2,4), (2, 8), (2, 17) 

(b) xryiffx < y; (2, 3), (3,2), (2,4), (5, 8) 

(c) xryiffy = x 2 ; (1,1), (2, 3), (2,4), (2, 6) 
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2. The following relations are on { 1 , 3 , 5} . Let r be the relation x r y iff y = x + 2 and s the relation x s y iff x < y. 
(a) Find rs. 

b) Find sr, 

(c) Illustrate rs and sr via a diagram. 

(d) Is the relation (set) rs equal to the relation srl Why? 

3. Let A = {1, 2, 3, 4 , 5} and define r on A by xry iff x + 1 = y. We 

define r 2 = rr and r 3 = r 2 r. Find: 
(a)r 

(b)r 2 
(c) r 3 

4. Given s and t, relations on Z, .s = {(1, n) : n e Z) and t = {(n, 1) : n e Z),what are .si and tt? 

B Exercises 

5 . Let p be the relation on the power set, P(S ), of a finite set S of cardinality n. Define p by (A, B) e p iff A f] B = 0, 

(a) Consider the specific case n = 3, and determine the cardinality of the set p. 

(b) What is the cardinality of p for an arbitrary nl Express your answer in terms of n. (Hint: There are three places that each element of S can 
go in building an element of p.) 

6. Let r\, r^, and r$ be relations on any set A. Prove that if r\ c ^then r\ r$ c r-i r^. 
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I 6.2 Graphs of Relations 



In this section we will give a brief explanation of procedures for graphing a relation. A graph is nothing more than an illustration that gives us, 
at a glance, a clearer idea of the situation under consideration. A road map indicates where we have been and how to proceed to reach our 
destination. A flow chart helps us to zero in on the procedures to be followed to code a problem and/or organize the flow of information. The 
graph of the function y = 2x + 3 in algebra helps us to understand how the function behaves. Indeed, it tells us that the graph of this function 
is a straight line. The pictures of relations in the previous section gave us an added insight into what a relation is. They indicated that there are 
several different ways of graphing relations . We will investigate two additional methods . 

Example 6.2.1, Let A = {0, 1, 2, 3), and let 

r = {(0, 0), (0, 3), (1, 2), (2, 1), (3, 2), (2, 0)}. 

The elements of A are called the vertices of the graph. They are represented by labeled points or occasionally by small circles. Connect vertex 
a to vertex b with an arrow, called an edge of the graph, going from vertex a to vertex b if and only \iarb. This type of graph of a relation r 
is called a directed graph or digraph . The result is Figure 6 .2 . 1 . Notice that since 1 r 2 and 2 r 1 , we draw a single edge between 1 and 2 with 
arrows in both directions. 




FIGURE 6.2.1 



The actual location of the vertices is immaterial. The main idea is to place the vertices in such a way that the graph is easy to read. Obviously, 
after a rough-draft graph of a relation, we may decide to relocate and/or order the vertices so that the final result will be neater. Figure 6.2.1 
could be presented as in Figure 6.2.2. 




FIGURE 6.2.2 

A vertex of a graph is also called a node, point, or a junction. An edge of a graph is also referred to as an arc, a line, or a branch. Do not be 
concerned if two graphs of a given relation look different. It is a nontrivial problem to determine if two graphs are graphs of the same relation. 

Example 6.2.2. Consider the relation s whose digraph is Figure 6.2.3. What information does this give us? Certainly we know that s is a 
relation of a set A, where A = {1, 2, 3) and 

* = {(1, 2), (2, 1), (1, 3), (3, 1), (2, 3), (3, 3)), 
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FIGURE 6.2.3 

Example 6.2.3. Let B = {a, b\, and let A = f(B) = {0, {a}, {b\, {a, b}\. Then c is a relation on A whose digraph is Figure 6.2.4. 




FIGURE 6.2.4 

This graph is helpful insofar as it reminds us that each set is a subset of itself (How?) and shows us at a glance the relationship between the 
various subsets in f (B). Some relations, such as this one, can also be conveniently depicted by what is called a Hasse, or ordering, diagram. To 
read a Hasse diagram for a relation on a set A, remember: 

(1) Each vertex of A must be related to itself, so the arrows from a vertex to itself are not necessary. 

(2) If vertex b appears above vertex a and if vertex a is connected to vertex b by an edge, then a r b, so direction arrows are not necessary. 

(3) If vertex c is above vertex a and if c is connected to a by a sequence of edges , then arc . 

(4) The vertices (or nodes) are denoted by points rather than by "circles." 

The Hasse diagram of the directed graph depicted in Figure 6.2.4 is Figure 6.2.5. 
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{1,2} 




FIGURE 6.2.5 
Example 6.2.4. Consider the relation s whose Hasse diagram is Figure 6.2.6. 




FIGURE 6.2.6 

How do we read this diagram? What is Al What is si What does the digraph of s look like? Certainly A = {1,2, 3, 4, 5) and 1 s2, 3 s4, 
1 s4, 1 s5, etc., so 

s = {(1, 1), (2, 2), (3, 3), (4, 4), (5, 5), (1, 3), (1,4), (1, 5), (1, 2), (3, 4), (3, 5), (4, 5), (2, 5)) 
A digraph for s is Figure 6.2.7. It is certainly more complicated to read than the Hasse diagram. 




FIGURE 6.2.7 Digraph of Example 6,2.4 
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EXERCISES FOR SECTION 6.2 
A Exercises 

1. LetA = {1, 2, 3, 4), and let r be the relation < on A. Draw a digraph and a Hasse diagram for r. 

2. LetS = {1,2, 3, 4, 6, 8, 12, 24), and let s be the relation "divides," on B. Draw a Hasse diagram for s. 

3. Draw a Hasse diagram of the relation c on P(A), where A = {a, b, c\. 

4. (a) Let A be the set of strings of 0's and l's of length 3 or less. Define the relation of d on A by xdy if x is contained within y. For example, 
01 d 101 . Draw a Hasse diagram for this relation. 

(b) Do the same for the relation p defined by x p y if x is a prefix of y. For example, 10 p 101 , but 01 p 101 is false. 

5. Draw the digraph for the relation p in Exercise 5 of Section 6.1, where S = {a, b). Explain why a Hasse diagram could not be used to depict 
P- 

6. LetC= {1,2, 3, 4, 6, 8, 12, 24} and define t on C by 

a t b if and only if a and b share a common divisor greater than 1 . 
Draw a digraph for t. 



Applied Discrete Structures by A. Doerr & K. Levasseur is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 



Chapter 6 - Relations 



I 6.3 Properties of Relations 



Consider the set B = { 1 , 2, 3, 4, 6, 12, 36, 48) and the relations "divides" and < on B. We notice that these two relations on B have several 
properties in common. In fact: 

(1) Every element in B divides itself and is less than or equal to itself. This is called the reflexive property. 

(2) If we search for two elements from B where the first divides the second and the second divides the first, then we are forced to choose the 
same first and second number. The reader can verify that a similar result is true for the relation < on B. This is called the antisymmetric 
property, 

(3) Next if we choose three numbers from B such that the first divides (or is <) the second and the second divides (or is <) the third, then this 
forces the first number to divide (or be <) the third. This is called the transitive property. 

Sets on which relations are defined which satisfy the above properties are of special interest to us. More detailed definitions follow. 
Definitions: Reflexive, Antisymmetric, and Transitive Relations. Let Abe a set and let r be a relation on A, then: 

(1) r is reflexive if and only if ar a for all a e A. 

(2) r is antisymmetric if and only if whenever arb and a £ b then bra is false; or equivalently whenever arb and bra then a = b. (The 
reader is encouraged to think about both conditions since they are frequently used.) 

(3) r is transitive if and only if whenever arb and brc then arc. 

A word of warning about antisymmetry: Students frequently find it difficult to understand this definition. Keep in mind that this term is defined 
through an "If . .. then . . ." statement. The question that you must ask is: Is it true that whenever there are elements a and b from A where arb 
and a =£ b, it follows that b is not related to al If so, then the relation r is antisymmetric. 

Another way to determine whether a relation is antisymmetric is to examine its graph. The relation is not antisymmetric if there exists a pair of 
vertices that are connected by edges in both directions . Note that the negation of antisymmetric is not symmetric We will define the symmetric 
property later. 

Definition: Partial Ordering. A relation on a set A that is reflexive, antisymmetric, and transitive is called a partial ordering on A. A set 
on which there is a partial ordering relation defined is called a partially ordered set or poset. 

Example 6.3.1. Let A be a set. Then V (A) together with the relation c is a poset. To prove this we observe that the three properties hold: 

(1) Let Bey (A), We must show that B c B. This is true by definition of subset. Hence, the relation is reflexive. 

(2) LetBj, B 2 eP (A) and assume that B t c B 2 and B { t B 2 . Could it be that B 2 c B t ? No. Why? Hence, the relation is anti symmetric. 

(3) Let Bi, B 2 ,B 3 eP(A) and assume that B ( c B 2 andB 2 c B 3 . Does it follow that Bi c B 3 ? Yes. Hence, the relation is transitive. 

Example 6.3.2. Consider the relation s defined by the Hasse diagram in Figure 6.2.6. A relation defined by a Hasse diagram is always a 
partial ordering. Let's convince ourselves of this. 

(1) First, j is reflexive? Yes, a Hasse diagram always implies that each element is related to itself. 

(2) Next, s is antisymmetric. From the diagram, can we find two different elements, say C[ and c 2 , such that c\ sc 2 and c 2 sc\l No. If 
d s c 2 , then c t and c 2 are connected by a series of edges in the Hasse diagram and ci is below c 2 . In order for c 2 s c^to be true, c 2 would 
need to be below c\ , which is impossible. 

(3) Finally, s is transitive. Again, this follows from the way Hasse diagrams are always interpreted. If C[ s c 2 , then c\ and c 2 are connected 
by a series of edges in the Hasse diagram and C\ is below c 2 . Similarly if c 2 s c 3 then c 2 and c 3 are connected by a series of edges in the 
Hasse diagram and c 2 is below c 3 . Thus, C]is below c 3 and we can patch together the two series of edges through c 2 to connect C\ to c 3 . 

Another property that is frequently referred to is that of symmetry. 

Definition: Symmetry. Let r be a relation on a set A. r is symmetric if and only if whenever arb, it follows that bra. 

Consider the relation of equality (=) defined on any set A. Certainly a = b implies that b = a so equality is a symmetric relation on A. 

Surprisingly, equality is also an antisymmetric relation on A. This is due to the fact that the condition that defines the antisymmetry property, 
a = b and a + b, is a contradiction. Remember, a conditional proposition is always true when the condition is false. So a relation can be both 
symmetric and antisymmetric on a set! Again recall that these terms are not negatives of one other. 

Definition: Equivalence Relation. A relation r on a set A is called an equivalence relation if and only if it is reflexive, symmetric, and 
transitive. 

The classic example of an equivalence relation is equality on a set A. In fact, the term equivalence relation is used because those relations which 
satisfy the definition behave quite like the equality relation. 

Example 6.3.3. Let Z* be the set of nonzero integers. One of the most basic equivalence relations in mathematics is the relation q on 
Z X Z* defined by (a, b)q(c, d) if and only if a d = be. We will leave it to the reader to, verify that q is indeed an equivalence relation. Be 
aware that since the elements of ZxZ'are ordered pairs, proving symmetry involves four numbers and transitivity involves six numbers. 
Two ordered pairs, (a, b) and (c, d), are related if the fractions - and - are numerically equal. 

b d 

Example 6.3.4. Consider the relation s described by the digraph in Figure 6.3.1. This relation is reflexive (Why?), not symmetric 
(Why?), and not transitive (Why?). Is s an equivalence relation? A partial ordering? It is neither, and among the valid reasons of stating this 
is that s isn't transitive. 
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FIGURE 6.3.1. 

A classic example of a partial ordering relation is < on the real numbers, R. Indeed, when graphing partial ordering relations, it is natural to 
"plot" the elements from the given poset starting with the "least" element to the "greatest" and to use terms like "least," "greatest," etc. Because 
of this the reader should be forewarned that many texts use the < notation when describing an arbitrary partial ordering. This can be quite 
confusing for the novice, so we continue to use the general notation r, s, etc., when speaking of relations. 

EXERCISES FOR SECTION 6.3 
A Exercises 

1 . (a) Let B = {a, b) and U = V(B). Draw a Hasse diagram for c on U. 

(b) LetA = {1,2, 3, 6). Show that divides, I, is a partial ordering on A. 

(c) Draw a Hasse diagram for divides on A. 

(d) Compare the graphs of parts a and c. 

2. Repeat Exercise 1 with B = {a, b, c) and A = {1,2, 3,5,6, 10, 15, 30}. 

3. (a) Consider the relations defined by the digraphs in Figure 6.3.2. Determine whether the given relations are reflexive, symmetric, antisym- 
metric, or transitive. Try to develop procedures for determining the validity of these properties from the graphs, 

(b) Which of the graphs in Figure 6.3.2 are of equivalence relations or of partial orderings? 
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(i) 



(ii) 





(iii) 




(iv) 



© 



® 



® 



(vi) 



(V) 





(vii) 




Figure 6.3.2 

4. Determine which of the following are equivalence relations and/or partial ordering relations for the given sets: 

(a) A = {lines in the plane); xr y if and only if x is parallel to v. 

(b) A = R; x r y if and only if | x — y \ < 7. 

5. Consider the following relation on { 1 , 2, 3, 4, 5, 6). r = {(i,j): \i - j\ = 2). 

(a) Is r reflexive? 

(b) Is r symmetric? 

(c) Is r transitive? 
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(d) Draw a graph of r. 

6. For the set of cities on a map, consider the relation xry if and only if city x is connected by a road to city y. A city is considered to be 
connected to itself, and two cities are connected even though there are cities on the road between them. Is this an equivalence relation or a 
partial ordering? Explain. 

7. Let A = (0, 1, 2, 3) and let 

r = {(0, 0), (1, 1), (2, 2), (3, 3), (1, 2), (2, 1), (3, 2), (2, 3), (3, 1), (1, 3)). 

(a) Show that r is an equivalence relation on A. 

(b) Let a e A and define c(a) = {b e A \ arb). c(a) is called the equivalence class of a under r. Find c (a) for each element a e A. 
(c) Show that {c (a) \ a <= A) forms a partition of A for this set A. 

(d) Let r be an equivalence relation on an arbitrary set A. Prove that the set of all equivalence classes under r constitutes a partition of A. 

8. Define r on the power set of { 1 , 2, 3)byArfi <=> | A | = \ B \ . Prove that r is an equivalence relation. What are the equivalence classes 
under rl 

9. Consider the following relations on Zg = {0, 1, . . . , 7). Which are equivalence relations? For the equivalence relations, list the equiva- 
lence classes. 

(a) arb iff the English spellings of a and b begin with the same letter. 

(b) a s b iff a - b is a positive integer. 

(c) a t b iff a - b is an even integer. 

10. Define I on A = {1, 2, . . . , 9) by xty iff x + y = 10. Is t an equivalence relation on A? If yes, list its equivalence classes. If no, why 
not? 

B Exercises 

11. In this exercise, we prove that implication is a partial ordering. Let A be any set of propositions . 

(a) Verify that q -» q is a tautology, thereby showing that => is a reflexive relation on A. 

(b) Prove that => is antisymmetric on A. Note: we do not use = when speaking of propositions , but rather equivalence, «=>. 

(c) Prove that => is transitive on A. 

(d) Given that q, is the proposition n < i on N, draw the Hasse diagram for the relation => on {qi, q2, <j3, ...). 

C Exercise 

12. Let S = {1, 2, 3, 4, 5, 6, 7) be a poset (5, <) with the Hasse diagram shown in Figure 6.3.3. Another relation r c 5x5 is defined as 
follows: (x, y) e r if and only if there exists z e 5 such that z < xnndz < y in the poset (5, <). 

(a) Prove that r is reflexive. 

(b) Prove that r is symmetric . 

(c) A compatible with respect to relation r is any subset Q of set 5 such that x e Q and y e Q => (x, y) <= r. A compatible g is a maximal 
compatible if Q is not a proper subset of another compatible. Give all maximal compatibles with respect to relation r defined above. 

(d) Discuss a characterization of the set of maximal compatibles for relation r when (5, <) is a general finite poset. What conditions, if any, 
on a general finite poset (5, <) will make r an equivalence relation? 
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I 6.4 Matrices of Relations 



We have discussed two of the many possible ways of representing a relation, namely as a digraph or as a set of ordered pairs. In this section we 
will discuss the representation of relations by matrices and some of its applications. 

Definition: Adjacency Matrix. Let A = {a h a 2 ,..., a m ) and B = {b h b 2 ,..., b n ) be finite sets of cardinality m and n, respectively . Let r 
be a relation from A into B. Then r can be represented by the mxn matrix R defined by 

1 if a t r bj 

*■ otherwise 

R is called the adjacency matrix (or the Boolean matrix, or the relation matrix) ofr. 



R„ 



Example 6.4.1. Let A = (2, 5, 6) and let r be the relation {(2, 2), (2, 5), (5, 6), (6, 6)) on A. Since r is a relation from A into the 
same set A (the B of the definition), we have a\ = 2, a 2 = 5, and 03 = 6, and b[ = 2,b 2 = 5, and b^ = 6. Next, since 



2 r 2, we have fin = 1; 




2r5, we have R[ 2 = 1; 


5r6, wehave/?23 = l;and 


6r6, we have /?33 = 1; 


All other entries of R are 0, so 




(1 1 -J 




R = 


1 
-001, 




From the del 


inition of r 


and of composition, we note that 



r 2 = {(2, 2) (2, 5) (2, 6) (5, 6) (6, 6)}, 
The adjacency matrix of r 2 is 
(1 1 1 \ 



R 2 



1 
10 1 ) 



We do not write R 2 only for notational purposes. In fact, R 2 can be obtained from the matrix product RR; however, we must use a slightly 
different form of arithmetic . 

Definition: Boolean Arithmetic. Boolean arithmetic is the arithmetic defined on {0,1} using Boolean addition and Boolean multiplica- 
tion, defined as: 

+ = 0+1 = 1+0=1 1 + 1 = 1 

0-0=0 0-1 = 1-0 = 1-1 = 1. 

Notice that from Chapter 3, this is the "arithmetic of logic," where + replaces "or" and • replaces "and." 
Example 6.4.2. 



If R-- 



(0 1 
10 10 
10 1 

{0 1 ) 



and S 



Then using Boolean arithmetic, if S 



(0 1 1 1 

11 

1 

1,0 

{0 1 l\ 

111 
11 

10 1 ) 



and SR- 



(\ 1 1 1 

111 

10 

{0 ) 



Theorem 6.4.1. Let Aj , A 2 , and A 3 be finite sets where r t is a relation from Ai into A 2 and r 2 is a relation from A 2 into Af.IfRi and R 2 
are the adjacency matrices of r t and r 2 , respectively, then the product Rj R 2 using Boolean arithmetic is the adjacency matrix of the 
composition r y r 2 . 

Remark: A convenient help in constructing the adjacency matrix of a relation from a set A into a set B is to write the elements from A in a 
column preceding the first column of the adjacency matrix, and the elements of B in a row above the first row. Initially, R in Example 6.4.1 
would be 



2 5 6 
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and Rfj is 1 if and only if (a,-, bj] e r. So that, since the pair (2, 5) e r, the entry oiR corresponding to the row labeled 2 and the column labeled 

5 in the matrix is a 1 . 

Example 6.4.3, This final example gives an insight into how relational data base programs can systematically answer questions pertain- 
ing to large masses of information. Matrices R (on the left) and S (on the right) define the relations r and s where arbii software a can be run 
with operating system b, and b s c if operating system b can run on computer c. 



OS1 OS2 OS3 OS4 



PI 
P2 
P3 

P4 



< 1 

1 1 



I 



o ^ 


1 
1 





CI 


C2 


C3 


OS1 


( 1 


1 


\ 


OS2 





1 





OS3 








1 


OS4 


I o 


1 


1 J 



Although the relation between the software and computers is not implicit from the data given, we can easily compute this information. The 
matrix of rs is RS, which is 



CI C2 C3 



PI 
P2 
P3 

P4 



( 1 
1 

1 
I 



1 \ 


1 
1 



This matrix tells us at a glance which software will run on the computers listed. In this case, all software will run on all computers with the 
exception of program P2, which will not run on the computer C3, and program P4, which will not run on the computer CI. 



EXERCISES FOR SECTION 6.4 
A Exercises 

1. Let A, = {1, 2, 3, 4), A 2 = {4, 5, 6), and A 3 = {6, 7, 8). Let r x be the relation from A x into A 2 defined by r x = {(x, y) \ y - x = 2), 
and let r 2 be the relation from A 2 into A 3 defined by r 2 = {(x, y)\ y — x = 1}. 

(a) Determine the adjacency matrices of r\ and r 2 . 

(b) Use the definition of composition to find r\ r 2 . 

(c) Verify the result in part by finding the product of the adjacency matrices of r\ and r 2 . 

2. (a) Determine the adjacency matrix of each relation given via the digraphs in Exercise 3 of Section 6.3. 

(b) Using the matrices found in part (a) above, find r 2 of each relation in Exercise 3 of Section 6.3. 

(c) Find the digraph of r 2 directly from the given digraph and compare your results with those of part (b) . 

3. Suppose that the matrices in Example 6.4.2 are relations on {1, 2, 3, 4). What relations do R and S describe? 

4. Let D be the set of weekdays, Monday through Friday, let W be a set of employees {1, 2, 3) of a tutoring center, and let V be a set of 
computer languages for which tutoring is offered, {A(PL), B(asic), C(++), J(ava), L(isp), P(ython)}. We define s (schedule) from D into Wby 
d s w if w is scheduled to work on day d. We also define r from W into V by w r I if w can tutor students in language /. If s and r are defined by 
matrices 



M 



1 2 3 
(I 1 



T 

W 
Th 
F 



and R-- 



A B C J L P 

1 1 1 ^ 

110 10 1 

10 11, 



1 1 

1 1 
1 

U 1 j 

(a) compute S R using Boolean arithmetic and give an interpretation of the relation it defines, and 

(b) compute S R using regular arithmetic and give an interpretation of the result describes . 

5. How many different reflexive, symmetric relations are there on a set with three elements? (Hint: Consider the possible matrices.) 

6. Let A = {a, b, c, d). Let r be the relation on A with adjacency matrix 





a 


b 


c 


d 


a 


(\ 








^ 


b 





1 








c 


1 


1 


1 





c 


^0 


1 





1 J 
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(a) Explain why r is a partial ordering on A. 

(b) Draw its Hasse diagram. 

7. Define relations p and q on { 1 , 2, 3, 4) by p = {(a, b) : \ a-b\ =l)and q = {(a, b) | a-Hseven) 

(a) Represent p and q as both graphs and matrices. 

(b) Determine p q, p 2 , and q 2 ; and represent them clearly in any way. 

B Exercises 

8. (a) Prove that if r is a transitive relation on a set A, then r 2 c r. 
(b) Find an example of a transitive relation for which r 2 + r. 

9. We define < on the set of all nxn relation matrices by the rule that if R and S are any two nxn relation matrices, R < S if and only if 

Ru < S;; for all 1 < i, j < n. 

ij ij -i 

(a) Prove that < is a partial ordering on all n x n relation matrices . 

(b) Prove that R < S => R 2 < S 2 , but the converse is not true. 

(c) If R and S are matrices of equivalence relations and R < S, how are the equivalence classes defined by R related to the equivalence classes 
defined by 5? 
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I 6.5 Closure Operations on Relations 



In Section 6.1, we studied relations and one important operation on relations, namely composition. This operation enables us to generate new 
relations from previously known relations. In Section 6.3, we discussed some key properties of relations. We now wish to consider the situation 
of constructing a new relation r + from a previously known relation r where, first, r + contains r and, second, r + satisfies the transitive property. 

Consider a telephone network in which the main office a is connected to, and can communicate to, individuals b and c. Both b and c can 
communicate to another person, d; however, the main office cannot communicate with d. Assume communication is only one way, as indicated. 
This situation can be described by the relation r = {(a, b), (a, c), (b, d), (c, d)}. We would like to change the system so that the main office 
a can communicate with person d and still maintain the previous system. We, of course, want the most economical system. 

This can be rephrased as follows; Find the smallest relation r + which contains r as a subset and which is transitive; 
r + = {(a, b), (a, c), (b, d), (c, d), (a, d)). 

Definition: Transitive Closure. Let A be a set and r be a relation on A. The transitive closure of r, denoted by r + , is the smallest 
transitive relation that contains r as a subset. 

Example 6.5.1. Let A = {1,2, 3,4} , and let S = {(1, 2), (2, 3), (3, 4)) be a relation on A. This relation is called the successor 
relation on A since each element is related to its successor. How do we compute S + ? By inspection we note that (1, 3) must be in S + . Let's 
analyze why. This is so because (1, 2) e S and (2, 3) e S, and the transitive property forces (1, 3) to be in S + . 

In general, it follows that if (a, b) eS and (b, c) e S, then (a, c) eS + . This condition is exactly the membership requirement for the pair 
(a, c) to be in the composition SS = S 2 . So every element in S 2 must be an element in S + . So far, <S + contains at least S U S 2 . In particu- 
lar, for this example, since S = {(1, 2), (2, 3), (3, 4))andS 2 = {(1, 3), (2, 4)), we have 

S(J S 2 ={(1, 2), (2, 3), (3, 4),(1, 3), (2, 4)). 

Is the relation S [J S 2 transitive? Again, by inspection, (1, 4) is not an element of S [J S 2 , but it must be an element of S + since (1,3) and 
(3, 4) are required to be in S + . From above, (1, 3) eS 2 and (3, 4) e S, and the composite S 2 S = S 3 produces (1, 4). This shows that 
<S 3 c S + . This process must be continued until the resulting relation is transitive. If A is finite, as is true in this example, the transitive 
closure will be obtained in a finite number of steps. For this example, 

S + =S(JS 2 (J 5 3 = {(1, 2), (2, 3), (3, 4), (1, 3), (2, 4), (1,4)}. 

Theorem 6.5.1. If r is a relation on a set A and \A\ = n, then the transitive closure ofr is the union of the first n powers ofr. That is, 

r* =r[j r 2 \J r 3 (J- U r». 
Let's now consider the matrix analogue of the transitive closure. 

Example 6.5.2. Consider the relation 

r = {(1, 4), (2, 1), (2, 2), (2, 3), (3, 2), (4, 3), (4, 5), (5, 1)) 

on the set A = {1,2, 3, 4, 5). The matrix of r is 

10^ 
1110 
A' = 1 
10 1 

U o o o o ) 



. can be determined through computing the matrix powers R 2 , R 3 , 



Here, 



R 2 = 



10 1 
11110 
1110 
110 
10 

11110 

11111 
11111 
11111 

110 



/? 3 = 



110 



and 



1 


1 


1 


1 


1 


1 


1 





1 


1 


I 1 





vO 





I 


1 J 


<\ 


1 


1 


1 1 


1 


1 


1 


1 


1 


1 


1 


1 


1 


1 


1 1 


1 


vl 


1 


1 1 


J 



How do we relate IJ r' to the powers of /?? 



Theorem 6.5.2. Let r be a relation on a finite set and let R + be the matrix of r + , the transitive closure of r. 
R + = R + R 2 + ■■■ + R", using Boolean arithmetic . 
Using this theorem, we find R + is the 5x5 matrix consisting of all 1 ' s, thus, r + is all of AxA. 



Then 
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WARSHALL'S ALGORITHM 

Let r be a relation on the set { 1 , 2, ... , n) with relation matrix R. The matrix of the transitive closure R + , can be computed by the equation 
R + = R + R + ■■■ + R" . By using ordinary polynomial evaluation methods, you can compute R + with n - 1 matrix multiplications: 

R + = R(I + R(I +(■■■«(/+ R) •■•))). 
For example, if n = 3,R = R(I + R(I + R)). 

We can make use of the fact that if T is a relation matrix, T + T = T due to the fact that 1 + 1 = 1 in Boolean arithmetic. Let 
S k = R + R 2 + ■■■ + R k .Then 
R = Si 

S [ (I + S i ) = R(I + R) = R + R 2 = S 2 

S 2 (I + S 2 ) = (R + R 2 )(I + R + R 2 ) 

= (R + R 2 ) + (R 2 + R 3 ) + (R 3 + R 4 ) 
= R + R 2 + R 3 + R A = S 4 
Similarly, 

S4I + S 4 ) = Sg 

etc.. 

Notice how each matrix multiplication doubles the number of terms that have been added to the sum that you currently have computed. In 
algorithmic form, we can compute R 2 as follows. 

Algorithm 6.5.1: Transitive Closure Algorithm 1. Let R be a known relation matrix and let R + be its transitive closure matrix, which is to be 
computed. 

l.Q.T:=R 
2.0. Repeat 

2.1 S:= T 

2.2 T:= S(I + S) // using Boolean arithmetic 
Until T = S 

3.0. Terminate with T = R + . 

Notes: 

(a) Often the higher-powered terms in S n do not contribute anything to R + . When the condition T = S becomes true in Step 2, this is an 
indication that no higher-powered terms are needed. 

(b) To compute R + using this algorithm, you need to perform no more than Pog2 "1 matrix multiplications, where fx] is the least integer that is 
greater than or equal to x. For example, if r is a relation on 25 elements, no more than \\og 2 25] = 5 matrix multiplications are needed. 

A second algorithm, Warshall's Algorithm, reduces computation time to the time that it takes to perform one matrix multiplication. 

Algorithm 6.5.2; Warshall's Algorithm. Let R be a known relation matrix and let R + be its transitive closure matrix, which is to be computed. 

1.0 T:= R 

2.0 FOR k: = lion DO 

FOR i : = 1 to n DO 
FORj := Hon DO 

T[i,j]:=T[i,j] + T[i,k]-T[k,j] 
3.0 Terminate with T = R + . 

EXERCISES FOR SECTION 6.5 
A Exercises 

1. Let A and S be as in Example 6.5.1. Compute S + as in Example 6.5.2. Verify your results by checking against the relation S + obtained in 
Example 6.5.1. 
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2 . Let A and r be as in Example 6 .5 .2 . Compute the relation r + as in Example 6 .5 . 1 . Verify your results . 

3. (a) Draw digraphs of the relations S, S 2 ,S 3 , and «S + of Example 6.5.1 . 

(b) Verify that in terms of the graph of S, a S + b if and only if b is reachable from a along a path of any finite nonzero length. 

4. Let r be the relation represented by the digraph in Figure 6.5 ,1 . 

(a) Find r + . 

(b) Determine the digraph of r + directly from the digraph of r. 

(c) Verify your result in part (b) by computing the digraph from your result in part (a). 




FIGURE 6.5.1 

5. (a) Define reflexive closure and symmetric closure by imitating the definition of transitive closure. 

(b) Use your definitions to compute the reflexive and symmetric closures of Examples 6.5.1 and 6.5.2. 

(c) What are the transitive reflexive closures of these examples? 

(d) Convince yourself that the reflexive closure of the relation < on the set of positive integers P is < . 

6. What common relations on Z are the transitive closures of the following relations? 

(a) a S b if and only if a + 1 = b. 

(b) a Rfo if and only if \a - b\ =2. 

B Exercise 

7. (a) Let Abe any set and r a relation on A, prove that (r + ) + =r + . 

(b) Is the transitive closure of a symmetric relation always both symmetric and reflexive? Explain. 
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I SUPPLEMENTARY EXERCISES FOR CHAPTER 6 

Section 6.1 

1 . Give an example to illustrate how the relation "is a grandparent of" is a composition of the relation "is a parent of" on people. 

2. Three students, Melissa, John, and Ted, would like to set up a tutorial program in the languages Pascal, FORTRAN, and COBOL. Melissa is 
proficient in all three languages, John in Pascal and FORTRAN, and Ted in just FORTRAN. 

(a) Let S = {three students), L = {three Languages), and let p be the relation "is proficient in the language of" . Describe this relation as 
a set of ordered pairs and illustrate the relation by a diagram similar to that of Figure 6.1.1. 

(b) Two P.C.s are available for tutoring purposes; one has software for Pascal and FORTRAN, and the second only for Pascal. Describe 
by a composite relation which student can tutor on each machine. Illustrate this composite relationship. 

Section 6.2 

3. Let A = {-1, 0, 1, 2). List the ordered pairs and draw the digraphs of each of the following relations on A. 

(a) r = {(x,y)\y = x+ 1) 

(b) s = {(x,y) \x 2 =y 2 } 

(c) t = {(x, y)\x*y) 

4. List the ordered pairs and draw the digraph of the relation s 2 for the relation s of Exercise 2, Section 6.2. 

5. In Figure 6.2, 1, assume the nodes stand for four separate cities where a manufacturer has warehouses, while the arrows represent one-way 
streets . Where should the manufacturer place his main office? Where is the least desirable location? How can we interpret the arrows in both 
directions between nodes 1 and 2? 

6. The problem of computer compatibility is an important one. In Figure 6.2.2 interpret the four nodes as representing computers, and an arrow 
from one node to another as "is compatible with" . Note that some software does not go both ways. 

(a) Is there any one computer that is not compatible with any other? 

(b) Is it possible to create a network where any computer could be linked with any other using at most two links? If not, what soft ware 
should be created to enhance compatibility? 

(c) If an arrow from a node to itself is interpreted as "high flexibility" of the system, does this affect your answer in part b? 

Section 6.3 

7. In Figure 6.3.2 (vii), interpret the four nodes as representing people, and an arrow from one node to another as "being friendly toward". Note 
that some friendships are not mutual. 

(a) Is there any individual in this group unfriendly to everyone else? 

(b) If this group were a committee, who is most likely to be the chairperson; that is, who is friendly toward the most people? 

(c) If an arrow from one vertex to itself is interpreted as "great person ality ," does your answer to part b still hold? 

(d) The four people are to be seated at a round table. A person is to be seated between two people only if he is friendly toward both of 
them. Does a seating arrangement exist? Is there more than one? 

8. Let A = {a, b, c, d, e) and let r, s, and t be the following relations on A: 

r = {(a, a), (a, b), (b, b), (b, c), (c, c), (c, d), (d, d), (d, e), (e, e), (e, a)} 

s = {(a, a), (a, b), (a, d), (b, a), (b, b), (b, d), (c, c), (d, a), (d, b), (d, d), (e, e)) 

t = {(a, a), (a, b), (b, b), (c, b), (c, c), (d, d), (e, a), (e, b), (e, c), (e, d), (e, e)) 

(a) Which relation is a partial ordering? Draw its Hasse diagram. 

(b) Which relation is an equivalence relation? List its equivalence classes. 

9. Demonstrate that the relation "living in the same house" on the set of people in a given city is an equivalence relation. State the necessary 
assumption for this to be the case. 

10. Let A = {00, 01, 10, 11), the set of strings of 0s and Is with length two. Given r and s defined by 

xry <=> xandy differ in exactly one position (for example 01 r 11, but not 10r01), and 
xsy <=> x and y have the same number of 0s . 

(a) Draw a directed graph of r. 

(b) Which of the adjectives, reflexive, symmetric, antisymmetric, and transitive, describe r? Explain your answers. 

(c) Which of the adjectives, reflexive, symmetric, antisymmetric, and transitive, describe si 

(d) Describe with a directed graph the relation rs. 

1 1 . Determine whether the following relations are partial orderings and/or equivalence relations on the given set: 

(a) C = {students in this class); xry iff x and y have the same grade point average. 

(b) C = {students in this class); xsy iff x is taller than y, 

(c) Rephrase (slightly) the relation in part b so it is a partial ordering relation. 
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12. Let A ■■ 



b, c, d). Draw the graph of a relation where the relation is: 



(a) reflexive, symmetric, but not transitive. 

(b) transitive, but not symmetric and not reflexive. 

(c) both an equivalence relation and a partial ordering. 

Section 6.4 

13. How many symmetric relations can there be on a four-element set? Hint: Think of the possible relation matrices. 

14. Let A = {1, 2, 3, 4, 5, 6) and let p = {(;', j) \ i divides j) be a relation on A. 

(a) List the elements in p. 

(b) Determine the relation matrix of p. 

(c) Construct the digraph and the Hasse diagram of p. 

15. Let A = (a, b, c). The following matrices describe relations on A: 



(i) 

(a) 
(b) 

(c) 



110 
110 
1 



(11) 



' 1 



1 



Draw the graph of the relation. 

Describe each relation as a set of ordered pairs . 

Compute r 2 for each relation r. 



Section 6.5 

16. Let the relation s on the set {a, b, c, d, e) be given by the matrix 





a 


b 


c 


d 


e 


a 


'0 











\ 


b 


1 





1 








c 





1 











d 








1 








e 


v 1 


1 








J 



17. 



(a) Draw the digraph of s. 

(b) Find the transitive closure of s. Give the adjacency matrix or the digraph or the set of ordered pairs. 

Consider the relation r on {1, 2, 3, 4} whose Boolean matrix is 





1 


2 


3 


4 


1 


ro 





1 


1 ^ 


2 





1 








3 








1 





4 


\\ 








J 



(a) Draw the graph of r. 

(b) Determine whether r is reflexive, symmetric, antisymmetric, and/or transitive. Explain fully. 

(c) Find the transitive closure of r and draw the graph of r + . 

18. In a small town a bank (b), school (s), town hall (f), and shopping mall (m) are connected by a series of narrow one-way streets; a street 
from the town hall to the bank, one from the bank to the school, one from the school to the shopping mall, and one from the shopping mall to 
the town hall. 

(a) Draw a digraph of this system of roads. 

(b) Find the matrix representation of the digraph in part a. 

(c) Assuming that the given streets cannot be widened, assist the mayor in planning the construction of new roads to increase traffic 
flow. Assume that if there is a one-way street from point a to b and one from point b to c, there should be one from point a toe. 

(d) If you have not done so yet, draw the matrix representation and the graph of your answer to part c and interpret the results for the 
mayor. 

19. The ambassadors of four countries are to meet with the ambassador of the United States (Aj) to discuss world problems. Some countries are 
friendly to each other, some are not, and in certain situations the friendship is one-way. The U.S. ambassador's daughter (who obviously took a 
discrete structures course) assists her father in diagnosing this complex situation and, using the relation "is friendly toward," has come up with 
the following digraph. 
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(a) Is there any person friendly to no one? 

(b) Who should be the chairman of this committee; that is, who is friendly to most people? 

(c) The U.S. ambassador would like to graph all friendships that can be developed through intermediaries on the committee. That is, if 
person a is friendly toward person b and person b is friendly toward person c, then a can communicate to c through b. Draw this digraph. Can 
the U.S. ambassador communicate to every person on the committee through some person(s)? If not, what friendships should he develop to do 
so? 
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chapter 7 




FUNCTIONS 



GOALS 

In this chapter we will consider some basic concepts of the relations that are called functions. A large variety of mathematical ideas and 
applications can be more completely understood when expressed through the function concept. 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3 .0 United States 



124 124 

Chapter 7 - Functions 



I 7.1 Definition of a Function and Notation 



Definition: Function. A function from a set A into a set B is a relation from A into B such that each element of A is related to exactly one 
element of the set B. The set A is called the domain of the function and the set B is called the codomain. 

The reader should note that a function /is a relation from A into B with two important restrictions: 

1 . Each element in the set A, the domain of/, must be related to some element of B, the codomain. 

2. The phrase "is related to exactly one element of the set B" means that if (a, b) e / and (a, c) e /, then b = c. 

Example 7.1.1. Let A = {-2, -1,0, 1, 2) and B = {0, 1, 2, 3, 4), and if s = ((-2, 4), (-1, 1), (0, 0), (1, 1), (2, 4)), then s is a 
function from A into B. 

Example 7.1.2. Let IR be the real numbers. Then L = {(x, 3 x) \ x E R) is a function from R into IR, or, more simply, L is a function on 
IR. 

We will use a different system of notation for functions than the one we used for relations. If /is a function from the set A into the set B, we will 
write / ' : A -> B. 

The reader is probably more familiar with the notation for describing functions that is used in basic algebra or calculus courses. For example, 
y = - or f(x) = - both define the function Ux, -J leR, x t 0). Here the domain was assumed to be those elements of R whose substitu- 
tions for x make sense, the nonzero real numbers, and the codomain was assumed to be IR. In most cases, we will make a point of listing the 
domain and codomain in addition to describing what the function does in order to define a function. 

The terms mapping, map, and transformation are also used for functions. 

One way to imagine a function and what it does is to think of it as a machine. The machine could be mechanical, electronic, hydraulic, or 
abstract. Imagine that the machine only accepts certain objects as raw materials or input. The possible raw materials make up the domain. Given 
some input, the machine produces a finished product that depends on the input. The possible finished products that we imagine could come out 
of this process make up the codomain. 

Example 7.13. f : R -» R defined by / (x) = x 2 is an alternate description of / = {(x, x 2 )\x e R). 

Definition: Image of an Element. Let f : A -> B, (read "let f be a function from the set A into the set B"). If a e A, then f (a) is used to 
denote that element of B to which a is related, f (a) is called the image of a, or, more precisely, the image of a under f. We write f (a) = b to 
indicate that the image of a is b. 

In Example 7.1.3, the image of 2 under/is 4; that is, /(2) = 4. In Example 7.1.1, the image of -1 under s is 1; that is, s(-\) = 1. 

Definition: Range of a Function. The range of a function is the set of images of its domain. If f:X -» Y, then the range of f is denoted 
f(X), and 

f(X) ={f(a)\a eX\ 

= \b eY\ 3 a eX such that f (a) = b) 

Note that the range of a function is a subset of its codomain. / (X) is also read as "the image of the set X under the function/ 1 or simply "the 
image of/." 

In Example 7.1.1, s (A) = {0, 1, 4). Notice that 2 and 3 are not images of any element of A. In addition, note that both 1 and 4 are related to 
more than one element of the domain s(\) = s(-\) = 1 and s(2) = s(-2) = 4. This does not violates the definition of a function. Go back 
and read the definition if this isn't clear to you. 

In Example 7.1.2, the range of L is equal to its codomain, IR. If b is any real number, we can demonstrate that it belongs to L(R) by finding a 
real number x for which L(x) = b. By the definition of L, L(x) = 3x, which leads us to the equation 3 x = b. This equation always has a 

solution, -; thus L(R) = R. 

The formula that we used to describe image of a real number under L, L(x) = 3x, is preferred over the set notation for L due to its brevity. 
Any time a function can be described with a rule or formula, we will use this form of description. In Example 7.1.1, the image of each element 
of A is its square. To describe that fact, we write s(a) = a 2 (a e A),orS:A -» B defined by S (a) = a 2 . 
There are many ways that a function can be described. The complexity of the function often dictates its representation. 

Example 7.1.4. Suppose a survey of 1,000 persons is done asking how many hours of television each watches per day. Consider the 
function W : {0, 1, ... , 24} -> (0, 1,2, ..., 1000) defined by 

W(t) = number of persons who gave a response of t hours . 

This function will probably have no formula such as the ones for s and L above. Besides listing the data in a table, a bar graph might be a 
good way to represent W. 

Example 7.1.5. Consider the function m : P -» Q defined by the set 
m = {(1, 1), (2, 1/2), (3, 9), (4, 1/4), (5, 25), ...). 
No simple single formula could describe m, but if we assume that the pattern given continues, we can write 
x 2 if x is odd 



m(x) 

\jx if x is even 
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FUNCTIONS OF TWO VARIABLES 

If the domain of a function is the Cartesian product of two sets, then our notation and terminology is changed slightly. For example, consider 
the function C : fsl xf^ -> t^l defined by C((»i, 112)) = n\ + n\ — n\ n^ + 10. For this function, we would drop one set of parentheses and write 
C (4, 2) = 22, not C ((4, 2)) = 22. We call C a function of two variables. From one point of view, this function is no different from any others 
that we have seen. The elements of the domain happen to be slightly more complicated. On the other hand, we can look at the individual 
components of the ordered pairs as being separate. If we interpret C as giving us the cost of producing quantities of two products, we can 
imagine varying t\\ while n^ is fixed, or vice versa. 

Mathematics Note 

There are several ways to define a function in Mathematica. One way is using a Function expression. For example the function / : Z -» Z, 
where /(x) = x can be defined by evaluating the expression 

f = Function [x, x 2 ] 

x \—> x 2 

Then we can compute the image of various numbers . Since Mathematica isn't a typed programming language the concept of a domain doesn't 
apply unless you specify a bit more structure. As it stands we can take the image of numbers, strings, matrices, or symbols, among other things: 



{f [-5], f [2.5], f ["Hello"], f [( * ^ J], f [6]} 

J25, 6.25, Hello 2 , [ 4 ], 2 ] 



Notice the syntax of Mathematica is to use square brackets such as f [x] instead of parentheses, f (x) for computing images under/. Notice 
that the square of the matrix isn't the matrix product, rather it is simpler componentwise product. 

If we want to restrict the possible inputs to a function that implements/, we might use the following alternate way of defining a function. 

fa[x_?IntegerQ] : = x 2 

If the "?IntegerQ" had been left out, fa and f would be identical, but as it is defined, the formula for fa only applies to integers. This 
makes fa more similar to/ We get what we expect when we ask for the image of an integer: 

{fa[-5], fa[25!]} 

(25,240597637008 332048087 335626 345604448 256000000000000) 
If we ask for the image of anything that isn't an integer, the expression is left unevaluated. 

|fa[2.5], fa ["Hello"], fa[( 3 _ j J], fa [6]} 

{fa[2.5], fa [Hello], fa[{{l, 2}, {3, -1}}], fa [9]} 
It is also possible to program Mathematica to issue an error message for these unintended cases. 

pS| Sage Note 

There are several ways to define a function in Sage. The simplest way to implement /is as follows. 

sage: f(x)=x"2 
sage: f 
x I — > x"2 

sage : f (4) 
16 

sage : f (-5.1) 
26.0100000000000 

Sage is built upon the programming language Python, which is a strongly typed language and so you can't evaluate expressions such as 
f ( "Hello" ) . However a function like f above will accept any type of number, so a bit more work is needed to restrict the inputs of f to the 
integers . 

A second way to define a function in Sage is based on Python syntax, 
sage: def fa(x): return x"2 



sage: fa(-4) 
16 
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sage: fa(5.1) 
26.0100000000000 

We close this section with two examples of relations that are not functions. 

Example 7.1.6. Let A = B = {1, 2, 3} and let / = {(1, 2), (2, 3)). Here / is not a function since / does not act on, or "use," all 
elements of A. 

Example 7.1.7. Let A = B = {1, 2, 3) and let g = {(1, 2), (2, 3), (2, 1), (3, 2)}. We note that g acts on all of A. However, g is still 
not a function since (2, 3) e g and (2, 1) e j and the condition on each domain being related to exactly one element of the codomain is 
violated. 

EXERCISES FOR SECTION 7.1 
A Exercises 

1. Let A = {1, 2, 3, 4)andB = [a,b, c, d). Determine which of the 
following are functions. Explain. 

(a) / c A x B, where/ = {(1, a), (2, b), (3, c), (4, d)). 

(b) gQAxB, where g = {(1, a), (2, a), (3,b), (A,d)). 

(c) h c Ax B, where A = {(1, a), (2, fo), (3, c)). 

(d) £ cAxB, where k = {(1, a), (2, b), (2, c), (3, a), (4, a)). 

(e) LcAxA, where L = {(1, 1), (2, 1), (3, 1), (4, 1)). 

2 . Let A be a set and let S be any subset of A . Let Xs'- A -> {0 , 1 ) be defined by 
1 ifxeS 



XS{X) - 10 if xtS 
The function Xs> is called the characteristic function of S, 

(a) If A = {a, b, c) and 5 = {a, b\, list the element of Xs ■ 

(b) If A = {a, b, c, d, e) and S = {a, c, e), , list the element of Xs- 

(c) If A = {a, b, c), what are X0 an d Xa- 

3. Find the ranges of each of the relations that are functions in Exercise 1 . 

4. Find the ranges of the following functions on Z: 

(a) g = {(x, Ax + 1) \x e Z) 

(b) h(x) = least integer that is greater than or equal to V \x\ . 

(c) P(x) = x + 10. 

B Exercise 

5 . If | A | and | B \ are both finite, how many different functions are there from A into B? 

6. Let/be a function with domain A and codomain B. Consider the relation K c A x A defined on the domain of /by (x, y) e ^f if and only 
if / (x) = f (y). The relation A^ is called the kernel off. 

(a) Prove that K is an equivalence relation. 

(b) For the specific case of A = Z , where Z is the set of integers , let / : Z -» Z be defined by / (x) = x 1 . Describe the equivalence classes of 
the kernel for this specific function. 
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I 7.2 Injective, Surjective, and Bijective Functions 

Consider the following functions; 

Example 7.2.1. LetA = {1, 2, 3, 4)andB = {a, b, c, d], and defined/ : A -> Bby 

/(l) = a, /(2) = b, /(3) = cand/(4) = d. 
Example 7.2.2. LetA = {1, 2, 3, 4)andB = {a, b, c, rf),and defined^ : A -* Bby 

g(l) =a,g(2) = b, g(3) = a andg(4) = b. 

The function in the first example gives us more information about the set B than the second function. Since A clearly has four elements /tells us 
that the set B contains at least four elements since each element of the set A is mapped onto one and only one element of the set B, The 
properties that / has and g does not have are the most basic properties that we look for in a function. The following definitions summarize the 
basic vocabulary for function properties . 

Definition Injective Function: A function/: A -> B is injective if 

a, be A, a±b => f(a) t f(b). 

Notice that the condition for a injective function is equivalent to 

a, b e A, f (a) = f{b) =* a = b 

Injective functions are also called injections, or one-to-one functions . 

Definition Surjective Function A function f: A -> B is surjective if its range, f(A), is equal to its codomian B 

Notice that the condition for a surjective function is equivalent to 

For all b e B, there exists ae.A such that f(a) = b. 

Surjective functions are also called surjections, or onto functions . 

Definition Bijective Function: A function f: A -> B is bijective if it is both injective and surjective. 

Bijective functions are also called one-to-one, onto functions . 

Example 7.2.3. The function /of Example 7.2.1 is bijective. The function g of Example 7.2.2 is neither injective nor surjective. 

Example 7.2.4. Let A = {1, 2, 3) and B = {a, b, c, d\, and define /: A -> Bby/(1) = b,f(2) = c, and/(3) = a. Then /is injective but 
not surjective. 

Example 7.2.5. The characteristic function, Xs m Exercise 2 of Section 7.1, is surjective if 5 is a proper subset of A, but never injective 
if|A| > 2. 

Example 7.2.6. Let A be the set of students who are sitting in a classroom, and let B be the set of seats in the classroom, and let s be the 
function which maps each student into the chair he or she is sitting in. When is s one to one? When is it onto? Under normal circumstances, s 
would always be one to one since no two different students would be in the same seat. In order for s to be onto, we need all seats to be used, so 
j is onto if the classroom is filled to capacity. 

Functions can also be used for counting the elements in large finite sets or in infinite sets. Let's say we wished to count the occupants in an 
auditorium containing 1,500 seats. If each seat is occupied, the answer is obvious, namely 1,500 people. What we have done is to set up a one- 
to-one correspondence, or bijection, from seats to people. We formalize in a definition. 

Definition: Cardinality. Two sets are said to have the same cardinality if there exists a bijection between them. 

The function/ in Example 7.2.1 serves to show that the two sets A = {1, 2, 3, 4) and B = {a, b, c, d) have the same cardinality. Notice in 
applying the definition of cardinality, we don't actually appear to count either set, we just match up the elements. However, matching the letters 
in B with the number 1 , 2, 3, an 4. is precisely how we count the letters. 

Definition: Countable Set. If a set is finite or has the same cardinality as the set of positive integers, it is called a countable set. 
If a set is finite, and its cardinality is n, then it has the same cardinality as the set { 1 , 2, 3, ... , n). 

Example 7.2.7. The alphabet {A, B, C, . . . , Z) has cardinality 26 through the following bijection into the set { 1 , 2, 3, ... , 26} 
A B C ■■■ Z 

v y y -is 

1 2 3 ••• 26 
Reminder: 2P = (ieP | b= 2 k for some k e P) 

Example 7.2.8. The set 2 P of even positive integers has the same cardinality as the set IP of positive integers. To prove this, we must 
find a bijection from IP to 2 P. Such a function isn't unique, but this one is the simplest: / : P -» 2 P by f(m) = 2m. Two statements must 
be proven to justify our claim that/is a bijection. 
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(1) /is one-to-one. 

Proof: Let a, b <= P and assume that /(a) = /(fo). We must prove that a = b. 
/(a) = /(/>) ^ 2a = 2fe ^ a = l>. ■ 

(2) /is onto. 

Proof: Let & e 2 P. We want to show that there exists an element a e P such that f(a) = b. If b e 2P, fo = 2 k for some £ e P by the 
definition of 2 P. So we have f(k) = 2k = b. Hence, each element of 2P is the image of some element of P. ■ 

An even easier way to look at any function with P as its domain by creating a list of the form /(l), /(2), /(3), . . . , which is 2,4,6,... for 
the function we've just defined. This infinite list clearly has no duplicate entries and every even positive integer appears in the list eventually. 

Bijections with domain IP: A function f :P -* A is a bijection if the infinite list f(l), f(2), f(3), ... contains no duplicates, and every 
element of A appears on in the list. 

Readers who have studied real analysis should realize that the set of rational numbers is a countable set, while the set of real numbers is not a 
countable set. See the exercises at the end of this section for an example of such a set. 

We close this section with an theorem called the Pigeonhole Principle, which has numerous applications even though it is an obvious, common- 
sense statement. Never underestimate the importance of simple ideas. The Pigeonhole Principle states that if there are more pigeons than 
pigeonholes, then two or more pigeons must share the same pigeonhole. A more rigorous mathematical statement of the principle follows. 

The Pigeonhole Principle. Let f be a function from a finite set X onto a finite set Y. If n > 1 and \ X | > n \ Y \, then there exists an 
element of Y that is the image of at least n + 1 elements ofX. 

Example 7.2.9. Assume that a room contains four students with the first names John, James, and Mary. Prove that two students have the 
same first name. We can visualize a mapping from the set of students to the set of first names; each student has a first name. The pigeonhole 
principle applies with n = 1 , and we can conclude that two of the students have the same first name. 

EXERCISES FOR SECTION 7.2 
A Exercises 

1 . Determine which of the functions in Exercise 1 of Section 7.1 are one- to-one and which are onto. 

2. (a) Determine all bijections from the { 1 , 2, 3) into {a, b, c). 
(b) Determine all bijections from { 1 , 2, 3) into {a, b, c, d\. 

3. Which of the following are one-to-one, onto, or both? 

(a) fa : R -> R defined by fa(x) = x 3 - x. 

(b) fa : Z -> Z defined by fa (x) = -x + 2. 

(c) f 3 : N xN -> N defined by f 3 (j, k) = 2' 3*. 

(d) fa : P -> P defined by fa (n) = r n/2"|, where \x\ is the ceiling of x, the smallest integer greater than or equal to x. 

(e) fa : N -» N defined by ./ (n) = n 2 + n. 

(f) fa : N -» N x N defined by fa (n) = {In, 2n+ 1). 

4. Which of the following are injections, surjections, or bijections on R, the set of real numbers? 

(a) f{x) = -2x. 

(b) g(x) = x 2 - 1. 

i x x < 

(c) h(x) = { 2 

y x 1 x>o 

(d) q{x) = 2' 

(e) r(x) = x 3 

(f) s(x) = x 3 - x. 

5. Suppose that m pairs of socks are mixed up in your sock drawer. Use the Pigeonhole Principle to explain why, if you pick m + 1 socks at 
random, at least two will make up a matching pair, 

6. In your own words explain the statement "The sets of integers and even integers have the same cardinality." 

7. Let A = (1, 2, 3, 4, 5). Find functions, if they exist that have the properties specified below 

(a) A function that is one-to-one and onto. 

(b) A function that is neither one-to-one nor onto 
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(c) A function that is one-to-one but not onto. 

(d) A function that is onto but not one-to-one. 

8. (a) Define functions, if they exist, on the positive integers, IP, with the same properties as in Exercise 7 (if possible). 

(b) Let A and B be finite sets where \A\ = \B \. Is it possible to define a function / : A -» B that is one-to-one but not onto? Is it possible 
to find a function g : A -> B that is onto but not one-to-one? 

B Exercises 

9. (a) Prove that the set of natural numbers is countable. 

(b) Prove that the set of integers is countable. 

(c) Prove that the set of rational numbers is countable. 

10. (a) Prove that the set of finite strings of 0's and l's is countable, 
(b) Prove that the set of odd integers is countable. 

(c) Prove that the set f*l x N is countable. 

1 1 . Use the Pigeonhole Principle to prove that an injection cannot exist between a finite set A and a finite set B if the cardinality of A is greater 
than the cardinality of B. 

12. The important properties of relations are not generally of interest for functions. Most functions are not reflexive, symmetric, antisymmet- 
ric, or transitive. Can you give examples of functions that do have these properties? 

13. Prove that the set of all infinite sequences of 0's and l's is not a countable set (i. e., that it is an uncountable set). 

14. Prove that the set of all functions on the integers is an uncountable set. 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 United States 



130 -,, . -. _. .. 130 

Chapter 7 - Functions 



I 7.3 Composition, Identity, and Inverse 



Now that we have a good understanding of what a function is, our next step is to consider an important operation on functions. Our purpose is 
not to develop the algebra of functions as completely as we did for the algebras of logic, matrices, and sets, but the reader should be aware of 
the similarities between the algebra of functions and that of matrices . We first define equality of functions . 

Definition: Equality of Functions. Let f, g : A -» B; that is, let f and g both be functions from A into B. Then f is equal to g (i. e., 
f = g) if and only if f (x) = g(x)forallx e A. 

Two functions that have different domains cannot be equal. For example, / : Z -» Z defined by f(x) = x 2 and g : R -> IR defined by g(x) = x 2 
are not equal even though the formula that defines them is the same. 

On the other hand, it is not uncommon for two functions to be equal even though they are defined differently. For example consider the 
functions h and k, where h : (-1,0, l,2}->{0, 1,2) is defined by h(x)= \x\ and k : (-1,0, l,2}->{0, 1,2) is defined by 

k(x) = — — \- x 2 + - appear to be very different functions . However, they are equal because h(x) = k(x) for x = -1, 0, 1, and2. 

COMPOSITION 

One of the most important operations on functions is that of composition. 

Definition: Composition of Functions. Let f : A -» B and g : B -> C. Then the composition off followed by g, written g °f is a function 
from A into C defined by (g °f) (x) = g(f (x)), which is read " g off of x." 

The reader should note that it is traditional to write the composition of functions from right to left. Thus, in the above definition, the first 
function performed in computing g °f, which is/. On the other hand, for relations, the composition r s is read from left to right, so that the first 
relation is r. 

Example 7.3.1. 

(a) Let /: (1, 2, 3) -> (a, b) be defined by /(l) = a, f (2) = a, and /(3) = b. Let g : {a, b) -> (5, 6, 7) be defined by g (a) = 5 and 
g(b) = 1. Then gof : {1, 2, 3} -» (5, 6, 7} is defined by (g°/)(l) = 5, (g°/)(2)=5, and (g°/)(3)=7. For example, 
(g°/)(l) = 8 (/(')) = g(a) = 5. Note that fog is not defined. Why? 

(b) Let / : R -» R be defined by / (x) = x 3 and let g : R -» R be defined by g (x) = 3 x + 1 . Then, since 

(g°f)Qc) «*(/(*)) = £(* 3 ) = 3x 3 + 1, 

wehaveg°/: R -> R is defined by (go/) (x) = 3 x 3 + 1. Here f°g is also defined and f°g : R -» R is defined by (f°g) (x) =(3x + l) 3 . 
Moreover, since 3 x 3 + 1 + (3 x + l) 3 for at least one real number, g°f + f°g- Therefore, the commutative law is not true for functions 
under the operation of composition. However, the associative law is true for functions under the operation of composition. 

Theorem 73.1. Function composition is associative. That is, iff : A -» B, g : B ->C, and h : C -» D, then h °(g of) = (h og) of. 

Proof Technique: In order to prove that two functions are equal, we must use the definition of equality of functions. Assuming that the 
functions have the same domain, they are equal if, for each domain element, the images of that element under the two functions are equal. 

Proof; We wish to prove that (h°(gof)) (x) = ((hog)°f) (x) for all x e A, which is the domain of both functions. 

(h°(g°f))(x) = h((gof)(x)) by the definition of composition of h with g ° f 
= h(g(f(x))) by the definition of composition of g with / 

Similarly, 

((hog)of)(x) = (hog)(f(x)) by the definition of composition of h °g with/ 
= h(g(f(x))) by the definition of composition of h withg 

Notice that no matter how the functions the expression ho gof is grouped, the final image of any element of x e A is h(g(f(x))) and so 
h°(g°f) = (hog)of. m 

If /is a function from a set A onto itself, we can find f° /, f° f° /, ... ,which we write as f 2 , f 3 , .... This idea can be expressed more 
elegantly as follows; If / : A -> A, the repeated composition of/ with itself is defined recursively as; 

Definition: Powers of Functions. Let f : A -> A. 

(1) /' = /; that is, f\a) = f(a),for a e A. 

(2) Forn>l,f n+l =/"/"; that is, f n+l (a) = f(f n (a)) and a eA. 

Two useful theorems concerning composition are given below. The proofs are left for the exercises. 

Theorem 7.3.2. If f : A -» B and g : B -» C are injections, then gof .- A -» C is an injection. 

Theorem 7.33. Iff : A -» B and g ; B -> C are surjections, then gof;A^>C is a surjection. 

We would now like to define the concepts of identity and inverse for functions under composition. The motivation and descriptions of the 
definitions of these terms come from the definitions of the terms in the set of real numbers and for matrices. For real numbers, the numbers 
and 1 play the unique role that x + = + x = x and x ■ 1 = 1 ■ x = x for any real number x. and 1 are the identity elements for the reals 
under the operations of addition and multiplication, respectively. Similarly, the n x n zero matrix and the n x n identity matrix / are such that 
for any n x n matrix A, A + = + A = A and A I = I A = I. Hence, an elegant way of defining the identity function under the operation of 
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composition would be to imitate the above well-known facts. 
IDENTITY FUNCTION 

Definition; Identity Function. For any set A, the identity function on A is a function from A onto A, denoted by i (or, more specifically, 
i A ) such that i(a) = a for all a e A 

Based on the definition of i, we can show that for all functions /: A -» A, /oj' = j'o/ = f. 

An alternate description is that the identity function on A is the function j*(a) = a for all a e A. This can be proven from the above definition. 

Example 7.3.2. If A = {1, 2, 3), then the identity function/ : A -> A is defined by i'(l) = 1, i(2) = 2, and i (3) = 3. 

Example 7.3.3. The identity function on R is i : R -» R defined by i (x) = x. 

INVERSE FUNCTIONS 

We will introduce the inverse of a function with a special case: the inverse of a function on a set. After you've taken the time to understand this 
concept, you can read about the inverse of a function from one set into another. The reader is encouraged to reread the definition of the inverse 
of a matrix in Section 5.4 to see that the following definition of the inverse function is a direct analogue of that definition. 

Definition: Inverse Function. Let f : A -» A. If there exists a function g : A -» A such that g°f = f°g = i, then g is called the 
inverse of f and is denoted by f' 1 , read "f inverse." 

Notice that in the definition we refer to "the inverse" as opposed to "an inverse." It can be proven that a function can never have more than one 
inverse (see exercises). 

An alternate description of the inverse of a function, which can be proven from the definition, is as follows: 

Let / : A -» A be such that / (a) = b. Then when it exists, f~ l is a function from A to A such that f~ [ (b) = a. Note that / _1 "undoes" what/ 
does. 

Example 7.3.4. Let A = {1, 2, 3) and let / be the function defined on A such that /(l) = 2, /(2) = 3, and /(3) = 1. Then 
/- 1 : A -> A is defined by f~\l) = 3,/" 1 (2) = l,and/-'(3) = 2. 

Example 7.3.5. If g : R -» R is defined by g(x) = x 3 , then g' 1 is the function that undoes what g does. Since g cubes real numbers, g~ l 

must be the "reverse" process, namely, takes cube roots. Therefore, g' 1 : R -» R is defined by g~ l (x) = y x . We should show that 
g' 1 og = i and g°g~ [ = i. We will do the first, and the reader is encouraged to do the second. 

(g' 1 °g)(x) = g' 1 (g(x)) Definition of composition 



= rV) 


Definition of g 


= {J 


Definition of g~ l 


= X 


Definition of cube root 


= i(x) 


Definition of the identity function 



Therefore, g °g = i. Why? 

The definition of the inverse of a function alludes to the fact that not all functions have inverses . How do we determine when the inverse of a 
function exists? 

Theorem 7.3.4. Let f : A -» A. / _; exists if and only if fis a bijection; i. e.fis one-to-one and onto. 

Proof: (=>) In this half of the proof, assume that / _1 exists and we must prove that/is one-to-one and onto. To do so, it is convenient 
for us to use the relation notation, where f(s) = t is equivalent to (s, t) e /. To prove that / is one-to-one, assume that f(a) = f(b) = c. 
Alternatively, that means (a, c) and (b, c) are elements of / . We must show that a = b. Since (a, b), (c, b) e /, (c, a) and (c, b) are in 
/~' . By the fact that /"' is a function and c cannot have two images, a and b must be equal, so/is one-to-one. 

Next, to prove that/is onto, observe that for / _1 to be a function, it must use all of its domain, namely A. Let b be any element of A. 
Then b has an image under / _1 , f (b). Another way of writing this is (b, f (b)) e / -I , By the definition of the inverse, this is equivalent to 
(f~ l (b), b) <= f. Hence, b is in the range of/. Since b was chosen arbitrarily, this shows that the range of/must be all of A. 

(«= ) Assume/is one-to-one and onto and we are to prove / _1 exists. We leave this half of the proof to the reader. ■ 

Definition; Permutation. A bijection of a set A into itself is called a permutation of A, 

Next, we will consider the situation where f: A->B and B is not necessarily equal to A. How do we define the inverse in this case? 

Definition: Inverse of a Function (General Case). Let f : A -» B, If there exists a function g : B -> A such that g °f = i A and 
f ° 8 = 'b • then g is called the inverse of f and is denoted by f' 1 , read "f inverse." 

Note the slightly more complicated condition for the inverse in this case because the domains of /° g and g °f are different if A and B are 
different. The proof of the following theorem isn't really very different from the special case where A = B. 

Theorem 73.5. Let f : A -» B. f' 1 exists if and only if fis a bijection. 

Example 7.3.6. Let A ={1,2, 3) and B = (a, b, c). Define /: A -> B by /(l) = a, /(2) = b, and /(3) = c. Then g : B -* A 
defined by g (a) = 1 , g(b) = 2, and g (c) = 3 is the inverse of/ 
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Cg»/)(1)=1 (fog){a) = a 

(gof) (2) = 2}=> go/ = i A and (fog)Q,) = b}=> fog = i B 

(g°/)(3) = 3 (/°g)(c) = c 

jgfr Mathematics Note 

Computer languages have many functions that also have inverses. Here are a few examples in Mathematica: 
RotateRight and RotateLef t are both functions on lists. For example, 

RotateRight [{1, 2, 3, 4, 5}] 

(5, 1,2, 3,4} 

RotateLeft[{l, 2, 3, 4, 5}] 

{2, 3,4,5, 1} 

RotateLef t [RotateRight [{1, 2, 3, 4, 5}]] 

(1,2,3,4,5) 

IntegerDigits and FromDigits are inverses of one another, with domains being the positive integers and lists of digits, respectively. 
For example 

IntegerDigits [1492] 

(1.4,9,2) 

FromDigits [{1, 4, 9, 2}] 

1492 

FromDigits [IntegerDigits [193 410] ] 

193410 

IntegerDigits [FromDigits [{4, 3, 1, 7, 8}]] 

(4, 3, 1,7,8) 

EXERCISES FOR SECTION 7.3 
A Exercises 

1. Let A = (1, 2, 3, 4, 5), B = (a, b, c, d, e, /), and C = (+, -). Define /: A ->- B by f(k) equal to the k th letter in the alphabet, and 
define g : B -» C by g(a) = + if a is a vowel and g(a) = - if a is a consonant. 

(a) Find gof. 

(b) Does it make sense to discuss f°gl If not, why not? 

(c) Does/" 1 exist? Why? 

(d) Does g-' exist? Why? 

2. Let A = (1, 2, 3). Define/: A ^ A by /(l) = 2, /(2) = l,and /(3) = 3. Find/ 2 ,/ 3 ,/ 4 and/-'. 

3. Let A = (1, 2, 3). 

(a) List all permutations of A. 

(b) Find the inverse of each of the permutations of part a. 

(c) Find the square of each of the permutations of part a. 

(d) Show that the composition of any two permutations of A is a permutation of A. 

(e) Prove that if A be any set where the \A\ =n, then the number of permutations of A is n!. 

4. Define s, u, and d, all functions on the integers, by s(n) = n 2 ,u(n) = n + 1 , and d (n) = n — 1, Determine: 
(a) u o s ° d 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3 .0 United States 



Chapter 7 - Functions 



(b) s ° u° d 

(c) d ° s ° u 

5. Based on the definition of the identity function, show that for all functions / : A -> A, f°i = i°f = f. 

6. Inverse images. If /is any function from A into B, we can describe the inverse image of from B into f(A), which is also commonly denoted 
f~ l .Mb e B, f~ l (b) = {a eA | f (a) = fo). If /does have an inverse, the inverse image of b is {f~ [ (b)\. 

(a) Letg : R ^ R be defined by g(x) = x 2 . What areg- 1 ^),^ 1 ^) and^'(-l)? 

(b) Ifr: R^Z, where r(x) = M, what is r _1 (l)? 

7. Let/, g, and ft all be functions from Z into Z defined by /(n) = n + 5,g(rc) = n - 2, and h(n) = n 2 . Define: 

(a) f°g 

(b) / 3 

(C) /oft 

8. Define the following functions on the integers by / (k) = k + l,g(k) = 2 k, and ft(fc) = \kj2~\ 

(a) Which of these functions are one-to-one? 

(b) Which of these functions are onto? 

(c) Express in simplest terms the compositions f°g, g °f, g ° ft, ft ° g, and ft 2 , 

B Exercises 

9. State and prove a theorem on inverse functions analogous to Theorem 5.4.1 (if a matrix has an inverse, that inverse is unique). 

10. Let/and g be functions whose inverses exist. Prove that (f°gY l = g~' °/~' . (Hint: See Exercise 3 of Section 5.4.) 

11. Prove Theorems 7.3.2 and 7.3.3. 

1 2 . Prove the second half of Theorem 7 .3 .4 . 

13. Prove by induction that if n> 2 and /, f 2 , ... , /„ are invertible functions on some nonempty set A, then 
(/i°/ 2 ° ■■■ ° /„)"' = /„"' °--'°.Z T 1 °/j _1 . The basis has been taken care of in Exercise 10. 

C Exercises 

14. (a) Our definition of cardinality states that two sets, A and B, have the same cardinality if there exists a bijection between the two sets. 
Why does it not matter whether the bijection is from A into B or B into A? (b) Prove that "has the same cardinality as" is an equivalence relation 
on sets. 

15 . Construct a table listing as many "Laws of Function Composition" as you can identify. Use previous lists of laws as a guide. 
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SUPPLEMENTARY EXERCISES FOR CHAPTER 7 

Section 7.1 

1. If/ : Z -> Z is defined by /(a) = 2 |a| + 1, 

(a) What is the domain of/? 

(b) What is the codomain of/? 

(c) What is the image of -5 under/? 

(d) What is the range of/? 

(e) Given that /(a) = 11, can you tell exactly what a must be? 

2. Let / : P -» P, where/(a) is the largest power of two that evenly divides a; for example, /(12) = 4, /(9) = 1, and/(8) = 8. Describe the 
equivalence classes of the kernel of/ 

3. Are any of the relations given in Figure 6.3.2 functions on the set A of nodes? Explain. 

4. Let U be a set with subsets A and B. 

(a) Showthatg: U -> {0, 1) defined by g(a) = min(C/i(a), Cs(a)) is the characteristic function of A f] B. (See Exercise 2 of Section 7.1.) 

(b) What characteristic function is h : U -» {0, 1) defined by ft(a) = max(C/i(a), Cgia))^ 

(c) How are the characteristic functions of A and A c related? 

Section 7.2 

5. Recall that every function is a relation. Suppose that /is a function on {1, 2, ..., n), where « is a positive integer. 

(a) What must the matrix of /look like? In other words, what distinguishes the matrix of/ from the matrix of the typical relation? How many 
1 s appear in the matrix? 

(b) If /is a bijection, what further property does the matrix of /have? (Hint for those who know chess: We will call the matrix of a bijection a 
"rook matrix.") 

6. UtS = {a, b, c}andletr={l,2, 3, 4} 

(a) Give an example of a relation from S to T that is not a function. 

(b) Give an example of a relation from S to T that is an injection. 

(c) How many injections are there from S to 77 

(d) How many surjections are there from 5 to 77 

7. Prove that the following sets are countable. 

(a) |n 2 :neN| 

(b) {-:neP} 

(c) (3,9,27, 81, ...)U(2, 4, 8, 16, ...) 

8. Prove that if A and B are countable sets, then their union is also countable. 

9. Prove that if A and B are any two sets, then |(AxB)| = |(BxA)|; that is, prove that AxB andBxA have the same cardinality. 

10. Let A be a finite set. 

(a) Is every injection / : A -» A a surjection? Explain. 

(b) Is every surjection / : A -» A an injection? Explain. 

(c) Are parts a and b true if A is an infinite set? Give a counterexample. 

1 1 . Two children playing "spy" have devised a code that consists of spelling each word backwards and replacing the letters a and t by m and q, 
respectively. Let A be the set of letters in the English alphabet. Assume the boys use characters only from A. Explain this code using the 
concept of functions. Explain why this code will or will not work, using the concept of functions. 

Section 7.3 
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12. Consider the functions /, g : IR -> R defined by f(x) = 8 x + 5 and g(x) = x 2 . 

(a) Show that/is injective. 

(b) Show that/is surjective. 

(c) Fmdf-'(x). 

(d) Findg °/(x). 

(e) Find / °g(x). 

13. If /, g, ft : N -» N , where /(a) = 10a, g(a) = a + 10, and ft(a) = adiv \0 {the quotient in a ± 10), calculate 

(a) (fog) (a). 

(b)g 2 (a) = (gog)(a). 
(c) (Ac/) ( fl ). 

(d)(ho g )(a). 

14. If/, w, rf : N -» N, where/(a) = 2a, w(a) = a + 1, and rf(a) = max(0, a - 1), calculate 

(a) (/o«)(o). 

(b) / 2 (a). 

(c) (do (/o«)) (a). 

(d) (rfow) (a) = a; therefore, what is t/° m? 

(e) Explain why d is not the inverse of w. 

15. Let/be a function on A = {a, b, c, d\ such that f(a) = c, and f(d) = b. 

(a) What are /(*) and /(c) if f°f = /? 

(b) What are /(£) and /(c) if f°f = il 

16. Let A be a nonempty set. Prove that if/ is a bijection on A and /»/ = /, then/is the identity function, ;'. Hint: You have seen a similar 
proof in matrix algebra. 

I a b \ 

17. For the real matrix A = , , detA-ad-bc. 

\c d ] 

A bijection from a set to itself is, also referred to as a. permutation of the set. Let n be a permutation of [a, b, c, d) such that a becomes Ji(d), b 
becomes n(b), etc. 

1 7i(a) 7t(b) \ . ...... 

Let B = ,. , , % . How many permutations of n leave the determinant of A invariant, that is, det A = detB? 

V 7r(c) n(d) ] 

18. (For those who have had calculus): Let V be the set of all functions that are defined and have infinitely many derivatives over some fixed 
interval of the real line. Let /eV and define D : V -> Vby D(f) = /', that is, D is the act of taking the derivative. Hence: 
D(x 2 + 3x + 5) = 2x + 3. 

(a) Is D a function? Explain. 

(b) Is D a bijection? Explain. 

(c) What is the interpretation ofD 2 = D°D, D 3 , ...,£>"? 

(d) Does D~ [ exist? If not, is there a function in calculus that is "close to" IT 1 ? What is it? 

19. The exponential function / : R -» R + defined by /(x) = ft- 1 has as its inverse the logarithmic function g(x) = log 6 x. 

(a) What are the domain and codomain of gl 

(b) Prove that g is the inverse of/. 

20. In high school, many of us saw the notation sin -1 x. Why is this notation used? 
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RECURSION AND RECURRENCE RELATIONS 



GOALS 

An essential tool that anyone interested in computer science must master is how to think recursively. The ability to understand definitions, 
concepts, algorithms, etc., that are presented recursively and the ability to put thoughts into a recursive framework are essential in computer 
science. One of our goals in this chapter is to help the reader become more comfortable with recursion in its commonly encountered forms. 

A second goal is to discuss recurrence relations. We will concentrate on methods of solving recurrence relations, including an introduction to 
generating functions . 



I 8.1 The Many Faces of Recursion 



Consider the following definitions, all of which should be somewhat familiar to you. When reading them, concentrate on how they are similar. 

Example 8.1.1. A very common alternate notation for the binomial coefficient is C (n; k). We will use the latter notation in this chapter. 

Here is a recursive definition of binomial coefficients. 



Definition: Binomial Coefficients. Assume n >0 and n > k > 0. 
C(n;0) = 1 

C(n, n) = 1 
and C(n; k) = C(n - 1; k) + C(n - 1; k - 1) ifn>k> 0. 



POLYNOMIALS AND THEIR EVALUATION 

Definition: Polynomial Expression in x over S (Non-Recursive). Let n be an integer, n >0. An n' h degree polynomial in x is an 

expression of the form a„x" + a n _ 1 x n ~ > + ■■■ + ajX + a , where a„, a n _ l , ...,a h a are elements of some designated set of numbers, S, 
called the set of coefficients and a„ t 0. 

We refer to x as a variable here, although the more precise term for x is an indeterminate. There is a distinction between the terms 
indeterminate and variable, but that distinction will not come into play in our discussions. 

Zeroth degree polynomials are called constant polynomials and are simply elements of the set of coefficients . 

This definition is often introduced in algebra courses to describe expressions such as f (ri) = 4-n 3 + 2n 2 - %n + 9, a third-degree, or cubic, 
polynomial in n. This definitions has has drawbacks when the variable is given a value and the expression must be evaluated. For example, 
suppose that n = 7. Your first impulse is likely to do this: 

/(7) = 4 - 7 3 + 2 v 7 2 - 8 x 7 + 9 

= 4x343+2x 49 -8x7 + 9= 1423 
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A count of the number of operations performed shows that five multiplications and three additions/subtractions were performed. The first two 
multiplications compute 7 2 and 7 3 , and the last three mutiply the powers of 7 times the coefficients. This gives you the four terms; and adding/- 
subtacting a list of k numbers requires k — 1 addition/subtractions. The following definition of a polynomial expression suggests another more 
efficient method of evaluation. 

Definition: Polynomial Expression in x over S (Recursive). Let S be a set of coefficients and x any variable. 

(a) A zeroth degree polynomial expression in x over S is a nonzero element of S. 

(b) For n > 1 , an n' h degree polynomial expression in x over S is an expression of the form p(x) x + a where p (x) is an (n - 1)"' degree 
polynomial expression in x and a e S. 

We can easily verify that f(n) is a third-degree polynomial expression in n over the Z based on this definition: 

f(n)= (An 1 + 2n-8)n + 9= ((An + 2)n - 8)n + 9 

Notice that 4 is a zeroth degree polynomial since it is an integer. Therefore An + 2 is a first-degree polynomial; therefore, (4 n + 2) n - 8 is a 
second-degree polynomial in n over Z; therefore, f (n) is a third-degree polynomial in n over Z. The final expression for f(n) is called its 
telescoping form. If we use it to calculate /(7), we need only three multiplications and three additions /subtractions. This is called Horner's 
method for evaluating a polynomial expression. 

Example 8.12. (a) The telescoping form of p (x) = 5 x 4 + 12 x 3 -6x 2 + x + 6 is (((5x + 12) x - 6) x + \)x + 6. Using Horner's 
method, computing the value of p(c) requires four multiplications and four additions/subtractions for any real number c. 

(b) g(x) = -x 5 + 3x 4 + 2x 2 + x has the telescoping form ((((- x + 3) x) x + 2) x + 1) x. 

Many computer languages represent polynomials as lists of coefficients, usually starting with the constant term. For example, 

g(x) = -x 5 + 3x 4 + 2x 2 + x would be represented with the list {0, 1,2,0,3,-1}. In both Mathematica and Sage, polynomial expressions 
can be entered and manipulated, so the list representation is only internal. Some lower-leveled languages do require users to program polyno- 
mial operations with lists. We will leave these programming issues to another source. 

Example 8.1.3. A recursive algorithm for a binary search of a sorted list of items: r = {/-(l), r(2) ... , r(n)) represent a list of n items 

sorted by a numeric key in descending order. The / h item is denoted r(j) and its key value by r(/).key. For example, each item might 
contain data on the buildings in a city and the key value might be the height of the building. Then r(\) would be the item for the tallest 
building. The algorithm BinarySearch (j, k) can be applied to search for an item in r with key value C. This would be accomplished by the 
execution of BinarySearch(l, n). When the algorithm is completed, the variable Found will have a value of true if an item with the desired 
key value was found, and the value of location will be the index of an item whose key is C. If Found stays false, no such item exists in 
the list. The general idea behind the algorithm is illustrated in Figure 8.1 .2. 

Higiicr Key* l.nwcr Kcya 



>-{/} 
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FIGURE 8.1.2 Illustration of BinarySearch 
In this algorithm, Found and location are "global" variables to execution of the algorithm. 

BinarySearch (J, k) : 
Found = False 
lfJ<K 
Then 

Mid = L(J+k)/2j 
If /-(Mid).key == C 
Then 

location = Mid 
Found = TRUE 
Else 
If r(Mid).key < C 

Then execute BinarySearchfj, Mid - 1) 
Else execute BinarySearch(Mid + 1 ,k) 

For the next two examples, consider a sequence of numbers to be a list of numbers consisting of a zeroth number, first number, second number, 
... . If a sequence is given the name 5, the k th number of 5, is usually written S k or S(k). 
Example 8.1.4 Define the sequence of numbers B by 

S = 100 and 

B k = 1.08 B k _ { fork> 1 

These rules stipulate that each number in the list is 1 .08 times the previous number, with the starting number equal to 100. For example 
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B 3 = 1.08 B 2 

= 1.08(1.08 60 

= 1.08 (1.08 (1.08 B )) 
= 1.08(1.08(1.08 100)) 

= 1.08 3 100 
= 125.971 

Example 8.1.5. The Fibonacci sequence is the sequence F defined by 

F = 1 , F[ = 1 and 

*"* =Fk-i +F k _,for k>2. 

RECURSION 

All of the previous examples were presented recursively. That is, every "object" is described in one of two forms. One form is by a simple 
definition, which is usually called the basis for the recursion. The second form is by a recursive description in which objects are described in 
terms of themselves, with the following qualification. What is essential for a proper use of recursion is that the objects can be expressed in 
terms of simpler objects, where "simpler" means closer to the basis of the recursion. To avoid what might be considered a circular definition, 
the basis must be reached after a finite number of applications of the recursion. 

To determine, for example, the fourth item in the Fibonacci sequence we repeatedly apply the recursive rule for F until we are left with an 
expression involving Fq and Fj : 

F 4 = F 2 + F 3 

= (F„ + F,) + (Fj + F 2 ) 

= (F +F 1 ) + (F 1 +(F +F 1 )) 

= (1 + 1) + (1 + (1 + 1)) 

= 5 

ITERATION 

On the other hand, we could compute a term in the Fibonacci sequence, say F 5 by starting with the basis terms and working forward as follows: 

F 2 = F + Fj = 1 + 1 = 2 
F 3 = F] + F 2 = 1 + 2 = 3 
F 4 = F 2 + F 3 = 2 + 3 = 5 
F 5 =F 3 +F 4 = 3 + 5 = 8 

This is called an iterative computation of the Fibonacci sequence. Here we start with the basis and work our way forward to a less simple 
number, such as. Try to compute F 5 using the recursive definition for F as we did for F4 . It will take much more time than it would have taken 
to do the computations above. Iterative computations usually tend to be faster than computations that apply recursion. Therefore, one useful 
skill is being able to convert a recursive formula into a nonrecursive formula, such as one that requires only iteration or a faster method, if 
possible. 

An iterative formula for C (n; k) is also much more efficient than an application of the recursive definition. The recursive definition is not 
without its merits, however. First, the recursive equation is often useful in manipulating algebraic expressions involving binomial coefficients. 
Second, it gives us an insight into the combinatoric interpretation of C(n; k). In choosing k elements from {1, 2, ..., n), there are 
C(n - 1; k) ways of choosing all k from {1,2, . . . , n - 1), and there are C(n - \;k- \) ways of choosing the k elements if n is to be 
selected and the remaining k - \ elements come from {1, 2, ..., n - 1). Note how we used the Law of Addition from Chapter 2 in our 
reasoning. 

BinarySearch Revisited. In the binary search algorithm, the place where recursion is used is easy to pick out. When an item is examined 
and the key is not the one you want, the search is cut down to a sublist of no more than half the number of items that you were searching in 
before. Obviously, this is a simpler search. The basis is hidden in the algorithm. The two cases that complete the search can be thought of as 
the basis. Either you find an item that you want, or the sublist that you have been left to search in is empty (j > k). 

BinarySearch can be translated without much difficulty into any language that allows recursive calls to its subprograms. The advantage to such 
a program is that its coding would be much shorter than a nonrecursive program that does a binary search. However, in most cases the recursive 
version will be slower and require more memory at execution time. 

INDUCTION AND RECURSION 

The definition of the positive integers in terms of Peano's Postulates (Section 3.7) is a recursive definition. The basis element is the number 1 
and the recursion is that if n is a positive integer, then so is its successor. In this case, n is the simple object and the recursion is of a forward 
type. Of course, the validity of an induction proof is based on our acceptance of this definition. Therefore, the appearance of induction proofs 
when recursion is used is no coincidence. 

Example 8.1.6. A formula for the sequence/? in Example 8.1.4 is B = 100(1.08)* for k > 0. A proof by induction follows: If k = 0, then 
B = 100 ( 1 .08)° = 100, as defined. Now assume that for some k > 1 , the formula for B k is true. 
B k+1 = 1 .08 B k by the recursive definition 

= 1 .08 ( 100 ( 1 .08)*) by the induction hypothesis 
= 100 ( 1 .08)* + ' hence the formula is true for k + 1 
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The formula that we have just proven for B is called a closed form expression. It involves no recursion or summation signs. 

Definition: Closed Form Expression. Let E = E(x h x 2 , .... x„) he an algebraic expression involving variables x h x 2 , .... x n which are 
allowed to take on values from some predetermined set. E is a closed form expression if there exists a number B such that the evaluation of 
E with any allowed values of the variables will take no more than B operations (alternatively, B time units). 

n 

Example 8.1.7. The sum Ein) = £ k is not a closed form expression because the number of additions needed evaluate E(n) grows 

k=i 

indefinitely with n. A closed form expression that computes the value of E(n) is , which only requires B = 3 operations. 

EXERCISES FOR SECTION 8.1 
A Exercises 

1. By the recursive definition of binomial coefficients, C(5; 2) = C(4; 2) + C(4; 1). Continue expanding C(5; 2) to express it in terms of 
quantities defined by the basis. Check your result by applying the factorial definition of C (n\ k). 

2. Define the sequence L by Lq = 5 and for k > 1, L k = 2 L k _ l - 7. Determine L4 and prove by induction that L k = 7 - 2 k+[ . 

3. Letp(jc) = x 5 + 3x 4 - 15 x 3 + x - 10. 

(a) Write p(x) in telescoping form. 

(b) Use a calculator to compute p (3) using the original form of p(x). 

(c) Use a calculator to compute p (3) using the telescoping form of p(x) . 

(d) Compare your speed in parts b and c . 

B Exercises 

4. Suppose that a list of nine items, (r(l), r(2), . . . , r(9)), is sorted by key in decending order so that r (3). key = 12 and r (4).key = 10. List the 
executions of BinarySearch that would be needed to complete BinarySearch(l,9) for: 

(a) C = 12 

(b) C=ll 

Assume that distinct items have distinct keys. 

5. What is wrong with the following definition of / : IR -> R? 

/(0) = land/(x) = f(x/2)/2if x ±0. 
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| 8.2 Seq 



uences 



Definition: Sequence. A sequence is a function from the natural numbers into some predetermined set. The image of any natural 

number k can be written interchangeably as S (k) or S k and is called the k' h term of S. The variable k is called the index or argument of the 
sequence. 

For example, a sequence of integers would be a function S '. N -> Z . 
Example 8.2.1. 

(a) The sequence A defined by A (k) = k 2 - k, k > 0, is a sequence of integers. 

(b) The sequence B defined recursively by £(0) = 2 and B (k) = B(k - 1) + 3 ioxk > 1 is a sequence of integers. The terms of B can be 
computed either by applying the recursion formula or by iteration. For example; 

B(3) = B(2) + 3 

= (5(1) + 3) + 3 

= ((B(0) + 3) + 3) + 3) 

= ((2 + 3) + 3) + 3 
= 11 

or 

B(l) = B(0) + 3 = 2 + 3 = 5 
B(2) = B(l) + 3 = 5 + 8 = 8 
B(3) = B(2) + 3 = 8 + 3 = 11. 

(c) Let C r be the number of strings of 0's and l's of length r having no consecutive zeros. These terms define a sequence C of integers. 
Remarks; 

(1) A sequence is often called a discrete function. 

(2) Although it is important to keep in mind that a sequence is a function, another useful way of visualizing a sequence is as a list. For 
example, the sequence A could be written as (0, 0, 2, 6, 12, 20, . . .). Finite sequences can appear much the same way when they are the 
input to or output from a computer. The index of a sequence can be thought of as a time variable. Imagine the terms of a sequence flashing on a 

screen every second. The s k would be what you see in the k lh second. It is convenient to use terminology like this in describing sequences. For 

example, the terms that precede the k th term of A would be A (0), A (1), . . . , A(k - 1). They might be called the earlier terms. 

A FUNDAMENTAL PROBLEM 

Given the definition of any sequence, a fundamental problem that we will concern ourselves with is to devise a method for determining any 
specific term in a minimum amount of time. Generally, time can be equated with the number of operations needed. In counting operations, the 
application of a recursive formula would be considered an operation. 

Example 8.2.2. 

(a) The terms of A in Example 8.2.1 are very easy to compute because of the closed form expression. No matter what term you decide to 
compute, only three operations need to be performed. 

(b) How to compute the terms of B is not so clear. Suppose that you wanted to know B (100). One approach would be to apply the definition 
recursively: 

B(100) = B(99) + 3 = (B(98) + 3) + 3 = ... 

The recursion equation for B would be applied 100 times and 100 additions would then follow. To compute B (k) by this method, 2 k operations 
are needed. An iterative computation of B (k) is an improvement: 

B(l) =B(0) +3 = 2 + 3 = 5 
B(2) =B(l) + 3= 5 + 3 = 8 

etc. 

Only k additions are needed. This still isn't a good situation. As k gets large, we take more and more time to compute B (k). The formula 
B(k) = B(k - 1) + 3 is called a recurrence relation on B. The process of finding a closed form expression for B (k), one that requires no more 
than some fixed number of operations, is called solving the recurrence relation. 

(c) The determination of C k is a standard kind of problem in combinatorics. One solution is by way of a recurrence relation. In fact, many 
problems in combinatorics are most easily solved by first searching for a recurrence relation and then solving it. The following observation will 
suggest the recurrence relation that we need to determine C k : If k > 2, then every string of 0's and l's with length k and no two consecutive 0's 
is either ls k _ { or 01 s k _ 2 , where s k _ t and s k _ 2 are strings with no two consecutive 0's of length k — 1 and k - 2 respectively. From this 
observation we can see that C k = C k _ 2 + C k _ { for k > 2. The terms Co = 1 and Ci = 2 are easy to determine by enumeration. Now, by 
iteration, any C k can be easily determined. For example, C 5 = 21 can be computed with five additions. A closed form expression for C k would 
be an improvement. Note that the recurrence relation for C k is identical to the one for the Fibonacci sequence (Example 8.1.4). Only the basis is 
different. 
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EXERCISES FOR SECTION 8.2 
A Exercises 

1 . Prove by induction that B (k) = 3 k + 2, k > 0, is a closed form expression for the sequence B in Example 8.2.1 . 

2. (a) Consider sequence Q defined by Q(k) = 2k + 9, k > 1. Complete the table below and determine a recurrence relation that describes Q. 

k Q(k) Q(k) - Q(k - 1) 

2 
3 

4 
5 
6 

7 

(b) Let A(k) = k 2 - k, k > 0. Complete the table below and determine a recurrence relation for A . Notice that 
(A(k) - A(k - 1) - (A(k - 1) - A(k - 2)) = A(k) - 2 A(k - 1) + A(k - 2) 

k A(k) A(k) -A(k-\) A(k) - 2 A(k - 1) + AQc - 2) 

2 
3 

4 
5 

3. Given k lines (k > 0) on a plane such that no two lines are parallel and no three lines meet at the same point, let P (k) be the number of 
regions into which the lines divide the plane (including the infinite ones (see Figure 8.2.1). Describe geometrically how the recurrence relation 
P(k) = P(k—\) + k can be obtained. Given that P(0) = 1, determine P (5). 




FIGURE 8.2.1 Exercise 3 

4. A sample of a radioactive substance is expected to decay by 0.15 percent each hour. If w,,t > 0, is the weight of the sample t hours into an 
experiment, write a recurrence relation for w. 

B Exercise 

5. Let M (n) be the number of multiplications needed to evaluate an n th degree polynomial. Use the recursive definition of a polynomial 
expression to define M recursively. 
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I 8.3 Recurrence Relations 



In this section we will begin our study of recurrence relations and their solutions. Our primary focus will be on the class of finite order linear 
recurrence relations with constant coefficients (shortened to finite order linear relations). First, we will examine closed form expressions from 
which these relations arise. Second, we will present an algorithm for solving them. In later sections we will consider some other common 
relations (8.4) and introduce two additional tools for studying recurrence relations: generating functions (8.5) and matrix methods (Chapter 12). 

Definition: Recurrence Relation. Let S be a sequence of numbers, A recurrence relation on 5 is a formula that relates all but a finite 
number of terms of S to previous terms of S. That is, there is a k in the domain of S such that if k > k , then S(k) is expressed in terms of 
some (and possibly all) of the terms that precede S(k). If the domain of 5 is {0 , 1,2, ...), the terms 5(0), 5(1), ..., S(k - 1) are not defined 
by the recurrence formula. Their values are the initial conditions (or boundary conditions, or basis) that complete the definition of S. 

Example 8.3.1. 

(a) The Fibonacci sequence is defined by the recurrence relation F k = F k _ 2 + F k _ l , k > 2 , with the initial conditions F = 1 and F, = 1 . The 
recurrence relation is called a second-order relation because F k depends on the two previous terms of F. Recall that the sequence C in Section 
8.2 can be defined with the same recurrence relation, but with different initial conditions. 

(b) The relation T (k) = 2 T(k - l) 2 - k T(k - 3) is a third-order recurrence relation. If values of T (0), 7(1), and 7 (2) are specified, then 7 
is completely defined. 

(c) The recurrence relation S(n) = 5([_«/2J) + 5, n > 0, with 5(0) =0has infinite order. To determine S(n) when n is even, you must go 
back n/2 terms. Since re/2 grows unbounded with n, no finite order can be given to 5. 

SOLVING RECURRENCE RELATIONS 

Sequences are often most easily defined with a recurrence relation; however, the calculation of terms by directly applying a recurrence relation 
can be time consuming. The process of determining a closed form expression for the terms of a sequence from its recurrence relation is called 
solving the relation. There is no single technique or algorithm that can be used to solve all recurrence relations. In fact, some recurrence 
relations cannot be solved. The relation that defines 7 above is one such example. Most of the recurrence relations that you are likely to 
encounter in the future as classified as finite order linear recurrence relations with constant coefficients. This class is the one that we will spend 
most of our time with in this chapter. 

Definition: n' h Order Linear Recurrence Relation. Let S be a sequence of numbers with domain k >0. An n' h order linear recurrence 
relation on 5 with constant coefficients is a recurrence relation that can be written in the form 

S(k) + C,S(k - 1) + ...+C n S(k - n) = f (k) for k >n 
where C j, C 2 , ■ ■■, C n are constants and f is a numeric function that is defined for k >n. 

Note: We will shorten the name of this class of relations to n' h order linear relations. Therefore, in further discussions, S(k) + 2 k S(k — 1) = 
would not be considered a first-order linear relation. 

Example 8.3.2. 

(a) The Fibonacci sequence is defined by the second-order linear relation because F k - F k _ i - F k _ 2 = 

(b) The relation P (j) + 2P(j — 3) = j 2 is a third-order linear relation. In this case, C\ = Ci = 0. 

(c) The relation A(k) = 2(A(k - X) + k) can be written as A (k) -2A(k - 1) = 2k. Therefore, it is a first-order linear relation. 

RECURRENCE RELATIONS OBTAINED FROM "SOLUTIONS" 

Before giving an algorithm for solving finite order linear relations, we will examine recurrence relations that arise from certain closed form 
expressions. The closed form expressions are selected so that we will obtain finite order linear relations from them. This approach may seem a 
bit contrived, but if you were to write down a few simple algebraic expressions, chances are that most of them would be similar to the ones we 
are about to examine. 

Example 8.3.3. 

(a) Consider D, defined by D(k) =5-2* ,k > 0. If k > 1, 

D(k) =5-2* = 2-5 -2 k - 1 = 2D(k - 1). 

Therefore, D satisfies the first order linear relation D (k) - 2 D (k — 1) = and the initial condition D (0) = 5 serves as an initial condition 
foiD. 

(b) If C(k) = 3*~' + 2 k+l + k ,k > 0, quite a bit more algebraic manipulation is required to get our result: 
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C{k) = 3* ' +2 k+[ +k Original equation 

3 C{k - 1) = 3*- 1 + 3 ■ 2 k + 3 (k - 1) Substitute k - 1 for k and multipy by 3 

Subtract the second equation from the first 

C(k) - 3 C(k - 1 ) = -2 k - 2 k + 3 3*~ ' term is eliminated , this is a first order relation 

2 C {k - 1) - 6 C {k - 2) = -2* - 2 (2 {k - 1) + 3) Substitute fc - 1 for fcin the 3 rd equation, mult, by 2 

Subtract the fourth equation from the third equation 

C(k) -5C(k-\)-6C(k-2) = 2k-l 2 k+[ term eliminated, this is a 2 nd order relation 

The recurrence relation that we have just obtained, defined for k > 2, together with the initial conditions C (0) = 7/3 and C (1) = 5, define 
C. We could do more algebra to obtain a third-order linear relation in this case. 

Table 8.3.1 summarizes our results together with a few other examples that we will let the reader derive. Based on these results, we might 
conjecture that any closed form expression for a sequence that combines exponential expressions and polynomial expressions will be solutions 
of finite order linear relations. Not only is this true, but the converse is true: a finite order linear relation defines a closed form expression that is 
similar to the ones that were just examined. The only additional information that is needed is a set of initial conditions. 

Closed Form Expression Recurrence Relation 

D{k) = 5-2* D(k)-2D(k-\) = 

C(k) = 3*-' + 2 k+] + k C(k) -2C(k-\)-6C(k-2) = 2k-l 

Q(k) = 2k + 9 Q(k)-Q(k-\) = 2 

A(k) = k 2 -k A(k)-2A(k-\)+A(k-2) = 2 

B(k) = 2k 2 + l B(k)-2B(k-\) + B(k-2) = A 

G(k) = 2-4* - 5 (-3)* G(k) - G(k - 1) + \2G(k - 2) = 

J{k) = (3 + k) 2 k J(k)-4J(k- \)+4J(k-2) = 

Table 8.3.1 

Recurrence Relation Obtained from Certain Sequences 

Definition: Homogeneous Recurrence Relation. An n' h order linear relation is homogeneous if f (k) = for all k. For each recur- 
rence relation S(k) + CjS(k - 1) +...+ C n S(k — n) = f(k), the associated homogeneous relation is 
S(k) + C,S(k - 1) +...+ C„S(k-n) =0 

Example 8.3.4. D(k) - 2D (k - 1) = is a first-order homogeneous relation. Since it can also be written as D (k) = 2D(k— 1), it 
should be no surprise that it arose from an expression that involves powers of 2 (see Example 8.3.3a). More generally, you would expect that 

the solution of L(k) - aL(k - 1) would involve a k . Actually, the solution is L(k) = L(0)a k , where the value of L(0) is given by the the 
initial condition. 

Example 8.3.5. Consider the second-order homogeneous relation S(k) — 7S(k - 1) + 12 S(k- 2) = together with the initial 

conditions S(0) =4 and 5(1) = 4. From our discussion above, we can predict that the solution to this relation involves terms of the form 

b a k , where b and a are nonzero constants that must be determined . If the solution were to equal this quantity exactly , then 

S(k) = ba k 
S(k- l) = ba k -' 
S(k -2) = b a k - 2 
Substitute these expressons into the recurrence relation to get 

ba k -lba k - 1 + \2ba k ^ =0 (Eq8.3a) 

Each term on the left-hand side of the equation has a factor of b a k ~ 2 , which is nonzero. Dividing through by this common factor yields 
a 2 -la + 12 = (a - 3) (a - 4) = 0. (Eq 8.3b) 

Therefore, the only possible values of a are 3 and 4. Equation (8.3b) is called the characteristic equation of the recurrence relation. The fact is 

that our original recurrence relation is true for any sequence of the form S(k) = b\ 3 k + b-i 4 k , where b\ and b^ are real numbers. This set of 
sequences is called the general solution of the recurrence relation. If we didn't have initial conditions for 5, we would stop here. The initial 
conditions make it possible for us to obtain definite values for b\ and b^- 

, 5(0) =4 \ ib 1 3 + b 2 4 =4^ i bi+b 2 =4 
\5(l)=4l ^\b ] 3 l +b 2 4 ] =4> ^\3b l+ 4b 2 =4 

The solution of this set of simultaneous equations is by = 12 and b 2 = -8 and so the solution is 5 (k) = 12 3* - 8 4*. 

Definition: Characteristic Equation. The characteristic equation of the homogeneous n' h order linear relation 
S(k) + C[S(k — 1) +...+ C„S(k — n) =0 is the nth degree polynomial equation 

a" + zZCja"-J = a"+ da" -1 + ••• +C„_!X + C„=0 

7=1 
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The left-hand side of this equation is called the characteristic polynomial . 
Example 8.3.6. 

(a) The characteristic equation of F (k) - F (k - 1) - F (k - 2) = is a 2 - a - 1 = 0. 

(b) The characteristic equation of Q (k) + 2 Q (k - 1) - 3 Q (k - 2) - 6 Q(k - 4) = is a 4 + 2 a 3 - 3 a 2 - 6 = 0. Note that the 
absence of a Q (k - 3) term means that there is not an x 4 ~ 3 = x term appearing in the characteristic equation. 

Algorithm 8.3.1: Algorithm for Solving Homogeneous n' h Order Linear Relations. 

(a) Write out the characteristic equation of the relation S(k) + C\S(k - 1) +...+ C„S(k — n) = 0, which is 
a" + C,a"-' + ■■■+C n _,x+C n =0. 

(b) Find all roots of the characteristic equation, called characteristic roots. 

(c) If there are n distinct characteristic roots, a h a 2 , ..., a n , then the general solution of the recurrence relation is 
S(k) = bj a k + b 2 a 2 k + ■■■ +b n a n k . If there are fewer than n characteristic roots, then at least one root is a multiple root. If aj is a double 
root, then the bj a/ term is replaced with (* + b, l kj a k . In general, if aj is a root of multiplicity p, then the bj a/ term is replaced with 
(b j0 +b jl k + ■■■ +b Ap _ 1) kP-')a j k . 

(d) If n initial conditions are given, obtain n linear equations in n unknowns (the bj' s from Step (c)) by substitution. If possible, solve 
these equations to determine a final form for S(k). 

Although this algorithm is valid for all values of n, there are limits to the size of n for which the algorithm is feasible. Using just a pencil and 
paper, we can always solve second-order equations. The quadratic formula for the roots of ax 2 +bx + c = 0is 

-b±\ b 2 -4ac 

X = 

2a 

The solutions of a 2 + Ci a + C 2 = are then 



•f-C 1 + Vci 2 -4C 2 ) and \ (-C, - ^ d 2 -4C 2 



(b) The characteristic roots are- (7 + V 49 - 40 I = 5 and - (7 - V 49 - 40 ) = 2. These roots can be just as easily obtained by factoring the 



Although cubic and quartic formulas exist, they are too lengthy to introduce here. For this reason, the only higher-order relations (n > 3) that 
you could be expected to solve by hand are ones for which there is an easy factorization of the characteristic polynomial. 

Example 8.3.7. Suppose that T is defined by T(k) =lT(k-\)- 10 T(k - 2), with , T (0) = 4 and T (1) = 17. We can solve this 
recurrence relation with Algorithm 8.3.1: 

(a) Note that we had written the recurrence relation in "nonstandard" form. To avoid errors in this easy step, you might consider a rearrange- 
ment of the equation to, in this case, T(k) - 7 T(k - 1) + 10 T(k - 2) = 0. Therefore, the characteristic equation is a 2 - 1 a + 10 = 0. 

l 

'2 

characteristic polynomial into (a - 5) (a - 2). 

(c) The general solution of the recurrence relation is T(k) = b{ 2 k + b 2 5 k , 

| T(0)=4 a |fo 1 2° + fo 2 5°=4| , b i+ b 2 =4 
{> I 7X1) = 17/ ^lfo,2 1 +fo 2 5 1 =4/ ^(2b ]+ 5b 2 = n 

The simulations equations have the solution by = 1 and b 2 = 3, Therefore, T(k) = 2 k + 3-5*. 

Here is one rule that might come in handy: If the coefficients of the characteristic polynomial are all integers, with the constant term equal to 
m, then the only possible rational characteristic roots are divisors of m (both positive and negative). 

With the aid of a computer (or possibly only a calculator), we can increase n. Approximations of the characteristic roots can be obtained by 
any of several well-known methods , some of which are part of standard software packages . There is no general rule that specifies the values 
of n for which numerical approximations will be feasible. The accuracy that you get will depend on the relation that you try to solve. (See 
Exercise 17 of this section.) 

Example 8.3.8. Solve S (k) - 1 S(k - 2) + 6S(k - 3) = 0, where 5(0) =8,5(1) = 6,and5(2) = 22. 

(a) The characteristic equation isa 3 -7a + 6 = 0. 

(b) The only rational roots that we can attempt are ± 1, ±2, ±3, and ± 6. By checking these, we obtain the three roots 1, 2, and —3. 

(c) The general solution is 5(£) = b\ \ k + b 2 2 k + b 3 (-3) k . The first term can simply be written b\ . 

5(0) = 8 b, + b 2 + b, = 8 

(d) { 5(1) = 6 | =* | *i +2*2 -3*3=6 

5(20 = 22 *!+4* 2 + 9*3=22 
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You can solve this system by elimination to obtain b\ = 5,b2 = 2, and b^ = 1 . Therefore, 

S(k) = 5 + 2-2* + (-3)* = 5 + 2* +1 + (-3)* 
Example 8.3.9. Solve D (k) - %D(k - I) + 16 D(k - 2) = 0, where D (2) = 16 and D (3) = 80. 

(a) Characteristic equation; a 2 - 8 a + 16 = 0. 

(b) a 2 - 8 a + 16 = (a - 4) 2 . Therefore, there is a double characteristic root, 4. 

(c) General solution: D (k) = (b w + b n k) 4*. 

|£>(2) = 16| r(b l0 + b n 2)4 2 = l6^ t 16b l0 + 32b n = 16 ^ r b >° = J 
C ; lD(3) = 80/ ^l(fo 10 + fo u 3)4 3 = 80/ => l64Z> 10 + i92Z> n = 80/ "* U = I 

11 4 

Therefore D (k) = (1/2 + (1/4)*) 4* = (2 + fc)4* -1 . 

SOLUTION OF NONHOMOGENEOUS FINITE ORDER LINEAR RELATIONS 

Our algorithm for nonhomogeneous relations will not be as complete as for the homogeneous case. This is due to the fact that different right- 
hand sides (f(k)'s) call for different procedures in obtaining a particular solution in Steps (b) and (c). 

Algorithm 83.2: Algorithm for Solving Nonhomogeneous Finite Order Linear Relations. 

To solve the recurrence relation S (k) + Cj S(k - 1) + ... + C n S(k — ri) = f(k): 

(a) Write the associated homogeneous relation and find its general solution (Steps (a) through (c) of Algorithm 8.3.1). Call this the homoge- 
neous solution, S^Xk). 

(b) Start to obtain what is called a particular solution, S^'Qc) of the recurrence relation by taking an educated guess at the form of a 
particular solu tion. For a large class of right-hand sides, this is not really a guess, since the particular solution is often the same type of 
function as f (k) (see Table 8.3.2). 

Right Hand Side, f(k) Form of a particular Solution, 5 <p) (^) 

constant, q constant, d 

linear function q + qjk linear function d + d t k 

m' h degree polynomial, m' h degree polynomial, 

q +q t k + ■■■ +q m k m d +d t k + ■■■ +d m k m 

exponential function q a k exponential function d a k 

Table 8.3.2 

Particular Solutions for Given Right-hand Sides 

(c) Substitute your guess from Step (b) into the recurrence relation. If you made a good guess, you should be able to determine the unknown 
coefficients of your guess. If you made a wrong guess, it should be apparent from the result of this substitution, so go back to Step (b). 

(d) The general solution of the recurrence relation is the sum of the homogeneous and particular solutions. If no conditions are given, then 
you are finished. If n initial conditions are given, they will translate to n linear equations in n unknowns and solve the system, if possible, to 
get a complete solution. 

Example 8.3.10. Solve S(k) + 5 S(k - 1) = 9, with 5(0) = 6. 

(a) The associated homogeneous relation, S(k) + 5 S(k — 1) = has the characteristic equation a + 5 = 0; therefore, a = -5. The 
homogeneous solution is S m (lc) = b (-5) k . 

(b) Since the right-hand side is a constant, we guess that the particular solution will be a constant, d. 

(c) If we substitute S (p \k) = d into the recurrence relation, we get d + 5d = 9,or6rf = 9. Therefore, S (p) (k) = 1.5 

(d) The general solution of the recurrence relation is 

S(k) = S ih \k) + S ip \k) =b(-5f + 1.5 
The initial condition will give us one equation to solve in order to determine b. 

5(0) = 6 => b(-5)° + 1.5 = 6 => b + 1.5 = 6 

Therefore,/; = 4.5 and 5 (k) = 4.5 (-5)* +1.5. 

Example 83.11. Consider T(k) - 7 T(k - 1) + 10T(k - 2) = 6 + 8fcwithr(0) = landT(l) = 2. 

(a) From Example 8.3.7, we know that T lf,) (k) = b\ 2 k + b 2 5 k . Caution: Don't apply the initial conditions to T <h) until you add T <p) ! 

(b) Since the right-hand side is a linear polynomial, T {p) is linear; that is, T (p \k) = d + d x k. 
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(c) Substitution into the recurrence relation yields: 
(d + dik)-l (do + d,(k- 1)) + 10 (do + d x (k - 2)) = 6 + 8 k 

=> (Ado- ttd x )+ (4d\)k = 6 + Sk 
Two polynomials are equal only if their coefficients are equal. Therefore, 

4rf -13rfi=6-i td = $ 
4 d\ = 8 / ^ I d\ = 2 . 

(d) Use the general solution 7"(£) = b\ 2 k + b 2 5 k + 8 + 2 £ and the initial conditions to get a final solution: 

7X0) =1-1 i b x + b 2 + 8 = 1 
7X1) = 2/ ^\2b [ +5b 2 + 10 = 2 
b { + b 2 = -7 
2b { +5b 2 = -S. 
b, =-9 
b 2 = 2 

Therefore, T(k) = -9-2* + 2-5* + 8 + 2 k 

A quick note on interest rates: When a quantity, such as a savings account balance, is increased by some fixed percent, it is most 
easily computed with a multipier. In the case of an 8% increase, the multier is 1 .08 because any original amount A, has 0.08 A added to it, so 
that the new balance is 

A + 0.08A = (1 + 0.08)A = 1.08A. 

Another example is that if the interest rate is 3 .5 % , the multiplier would be 1 .035 . This presumes that the interest is applied a the end of year 
for 3.5% annual interest, often called simple interest. If the interest is applied monthly, and we assume a simplifed case where each month 

has the same length, the multiplier after every month would be 1 1 + ^—\ « 1 .0292. After a year passes, this multiplier would be applied 12 
times, which is the same as multiplying by 1 .0292 12 ~ 1 .3556. That increase from 1 .035 to 1 .3556 is the effect of compound interest. 

Example 8.3.12. Suppose you open a savings account that pays an annual interest rate of 8%. In addition, suppose you decide to deposit 
one dollar when you open the account, and you intend to double your deposit each year. Let B (k) be your balance after k years. B can be 

described by the relation B (k) = 1.08 B (k - 1) + 2*, with 5(0) = 1 . If , instead of doubling the deposit each year, you deposited a constant 

amount, q, the 2 k term would be replaced with q, A sequence of regular deposits such as this is called an annuity. 

Returning to the original situation, we can obtain a closed form expression for B'*': 

(a) B {h \k) = b l (imf 

(b) B { i>\k) should be of the form d 2 k . 

(c) d2 k = 1.08 d 2 k -' +2 k 

=> (2d)2 k - 1 = 1.08rf2*- 1 + 2-2*- 1 

=> 2d = 1.08 d + 2 

=> .92 d = 2 

=> d= 2.174 (to the nearest thousandth) 

Therefore B^\k) = 2.174- 2* 

(d) B(0) = 1 =* b-i +2.174 = 1 

=> bi= -1.174 

B(k) = -1.174- 1.08*+ 2.174-2*. 

Example 8.3.13. Find the general solution to S (k) - 3S(k - 1) - 4S(k - 2) = 4 k . 

(a) The characteristic roots of the associated homogeneous relation are -1 and 4. Therefore, 5 <A) (^) = b\(-Y) k + b 2 4 k . 

(b) A function of the form d\ k will not be a particular solution of the nonhomogeneous relation since it solves the associated homogeneous 
relation. When the right-hand side involves an exponential function with a base that equals a characteristic root, you should multiply your 

guess at a particular solution by k. Our guess at S <p) (£) would then be d k 4 k . See below for a more complete description of this procedure. 

(c) Substitute dk\ k into the recurrence relation for S (k): 

dk4 k -3d(k- 1)4*-' - 4 d (k - 2) 4 k - 2 = 4 k 

I6dk4 k - 2 - \2d(k- l)4 k - 2 -4d(k-2)4 k - 2 =4 k 
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Each term on the left-hand side has a factor of 4 

I6dk - \2d(k - 1) - Ad(k - 2) = 4 2 
20d =16 => rf = 0.8 

Therefore, S {p) (k) = 0.8 * 4 4 

(d) The general solution to the recurrence relation is 

S(k) =bi(-l) k + b 2 A k + 0.8 £4* 

BASE OF RIGHT-HAND SIDE EQUAL TO CHARACTERISTIC ROOT 

If the right-hand side of a nonhomogeneous relation involves an exponential with base a, and a is also a characteristic root of multiplicity p, 
then multiply your guess at a particular solution as prescribed in Table 8,3-2 by k p , where k is the index of the sequence. 
Example 8.3.14. 

(a) \iS(k) - 9S(k - 1) + 20S{k - 2) = 2-5*, the characteristic roots are 4 and 5. S ip \k) will take the fornidk5 k . 

(b) If S(ri) — 6 S(n — 1) + 9 S{n - 2) = 3" +1 the only characteristic root is 3, but it is a double root (multiplicity 2). Therefore, the form 
of the particular solution is dn 2 3". 

(c) If Q(j) - Q(j - 1)- 12 Q(j - 2) = (-3y + 6-4 ; , the characteristic roots are -3 and 4. The form of the particular solution will be 
dlj( - 3 y+ d 2 j. 4 j, 

(d) If S(k) - 9S(k - 1) + 8 S(k - 2) = 9 k + 1 = (9 k + 1)1* , the characteristic roots are 1 and 8. If the right-hand side is a polyno- 
mial, as it is in this case, then the exponential factor 1* can be introduced. The particular solution will take the form k(do + d\ k). 

We conclude this section with a comment on the situation in which the characteristic equation gives rise to complex roots . If we restrict the 
coefficients of our finite order linear relations to real numbers , or even to integers , we can still encounter characteristic equations whose roots 
are complex. Here, we will simply take the time to point out that our algorithms are still valid with complex characteristic roots, but the 
customary method for expressing the solutions of these relations is different. Since an understanding of these representations requires some 
background in complex numbers, we will simply suggest that an interested reader can refer to a more advanced treatment of recurrence 
relations (see also difference equations) . 

EXERCISES FOR SECTION 8.3 
A Exercises 

Solve the following sets of recurrence relations and initial conditions: 

1. S(k) - \0S(k - 1) + 9S(k - 2) = 0, 5(0) = 3, 5(1) = 11 

2. S(k) - 9S(k - 1) + lSS(k - 2) = 5(0) = 0, 5(1) = 3 

3. S(k) - 0.25 S(k - 1) = 0, 5(0) = 6 

4. S(k) - 2QS(k - 1) + lQ0S(k - 2) = 0, 5(0) = 2, 5(1) = 50 

5. S(k) - 2S(k - 1) + S(k - 2) = 2 5(0) = 25, 5(1) = 16 

6. S(k) - S(k - 1) - 6S(k - 2) = -30 5(0) = 7, 5(1) = 10 

7. S{k) - 5S(k - 1) = 5*, 5(0) = 3 

8. S(k) - 5S(k - 1) + 6S(k - 2) = 2, 5(0) = -1, 5(1) = 

9. S(k) - 4S(k - 1) + 4S(k - 2) = 3k + 2 k . 5(0) = 1, 5(1) = 1 

10. S(k) = rS(k - 1) + a , 5(0) = 0, r, a > 0, r * 1 

11. S(k) - 4S(k - 1) - HS(k- 2)+ 3QS(k - 3) = 0, 
5(0) = 0, 5(1) = -35, 5(2) = -85 

12. Find a closed form expression for P (k) in Exercise 3 of Section 8.2. 

13. (a) Find a closed form expression for the terms of the Fibonacci sequence (see Example 8.1 .4). 

(b) The sequence C was defined by C r = the number of strings of zeros and ones with length r having no consecutive zeros (Example 
8.2.1(c)). Its recurrence, relation is the same as that of the Fibonacci sequence. Determine a closed form expression for C, , r > 1 , 

n 

14. If S(n) = X g(j), n > 1 , then 5 can be described with the recurrence relation S(ri) = S(n - 1) + g (n). For each of the following sequences 

7=1 

that are defined using a summation, find a closed form expression: 
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(a) S(n) = £ j, »>1 

7=1 

(b) G(») = £/. «l 

7=1 

(c) P(n) = E(;) J , «>0 

7=1 

(d) r(n)= £/, n>\ 

7=1 

B Exercises 

15. Let D («) be the number of ways that the set { 1 , 2, . .., «),« > 1 , can be partitioned into two nonempty subsets . 

(a) Find a recurrence relation for D. (Hint: It will be a first-order linear relation.) 

(b) Solve the recurrence relation. 

16. If you were to deposit a certain amount of money at the end of each year for a number of years, this sequence of payment would be called 
an annuity (see Example 8.3.12,). 

(a) Find a closed form expression for the balance or value of an annuity that consists of payments of q dollars at a rate of interest of i. Note 
that for a normal annuity, the first payment is made after one year. 

(b) With an interest rate of 12.5%, how much would you need to deposit into an annuity to have a value of one million dollars after 18 years? 

(c) The payment of a loan is a form of annuity in which the initial value is some negative amount (the amount of the loan) and the annuity 
ends when the value is raised to zero. How much could you borrow if you can afford to pay $5,000 per year for 25 years at 14% interest? 

C Exercises 

17. Suppose that C is a small positive number. Consider the recurrence relation B (k) - 2B(k - 1) + (l - C 2 )B(k — 2) = C 2 , with initial 

conditions B (0) = 1 and 6(1) = 1 . If C is small enough, we might consider approximating the relation by replacing 1 - C 2 with 1 and C 2 with 
0. Solve the original relation and its approximation. Let B a a be the solution of the approximation. Compare closed form expressions for B (k) 
and B a (k). Their forms are very different because the characteristic roots of the original relation were close together and the approximation 
resulted in one double characteristic root. If characteristic roots of a relation are relatively far apart, this problem will not occur. For example, 
compare the general solutions of 

S(k) + 1.001S(* - 1) - 2.004002 S(k - 2) = 0.0001 and 
S a (k) + S a (k - 1) - 2S a (k - 2) = 0. 
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I 8.4 Some Common Recurrence Relations 



In this section we intend to examine a variety of recurrence relations that are not finite-order linear with constant coefficients . For each part of 
this section, we will consider a concrete example, present a solution, and, if possible, examine a more general form of the original relation. 

Example 8.4.1. Consider the homogeneous first-order linear relation S(n) - nS(n - 1) = 0, n > 1, with initial condition 5(0) = 1. 
Upon close examination of this relation, we see that the nth term is n times the (n - l) st term, which is a property of n factorial. S (n) = n ! is 
a solution of this relation, for if n > 1 , 

S(n) = n\ = n -(n — 1)! = n- S(n — 1). 

In addition, since 0! = 1, the initial condition is satisfied. It should be pointed out that from a computational point of view, our "solution" 
really isn't much of an improvement since the exact calculation of n ! takes n - 1 multiplications. 

If we examine a similar relation, G{k) - 2 k G (k - 1), k > 1 with G (0) = 1 , a table of values for G suggests a possible solution: 
k 12 3 4 5 



G(k) 1 2 2 3 2 6 2 10 2 15 

The exponent of 2 in G(k) is growing according to the relation E(k) = E(k - 1) + k, with E(Q) = 0. Thus E(k)= and 

G (k) = 2 klk+v,/2 . Note that G (k) could also be written as 2° 2 1 2 2 • • -2 k , for k > 0, but this is not a closed form expression. 

In general, the relation P (n) = f (n) P{n — 1) for n > 1 with P (0) = / (0), where/is a function that is defined for all n > 0, has the "solution" 

P(n)= flfik) 

k=0 

This product form of P (n) is not a closed form expression because as n grows, the number of multiplications grow. Thus, it is really not a true 
solution. Often, as for G (k) above, a closed form expression can be obtained from the product form. 

Example 8.4.2. Analysis of a Binary Search Algorithm. Suppose you intend to use a binary search algorithm (see Example 8.1 .3) on 
lists of zero or more sorted items, and that the items are stored in an array, so that you have easy access to each item. A natural question to 
ask is "How much time will it take to complete the search?" When a question like this is asked, the time we refer to is often the so - called 
worst - case time. That is, if we were to search through n items, what is the longest amount of time that we will need to complete the search? 
In order to make an analysis such as this independent of the computer to be used, time is measured by counting the number of steps that are 
executed. Each step (or sequence of steps) is assigned an absolute time, or weight; therefore, our answer will not be in seconds, but in 
absolute time units. If the steps in two different algorithms are assigned weights that are consistent, then analyses of the algorithms can be 
used to compare their relative efficiencies . There are two major steps that must be executed in a call of the binary search algorithm: 

(1) If the lower index is less than or equal to the upper index, then the middle of the list is located and its key is compared to the value that 
you are searching for. 

(2) In the worst case, the algorithm must be executed with a list that is roughly half as large as in the previous execution. If we assume that 
Step 1 takes one time unit and T (n) is the worst - case time for a list of n items , then 

T(n)= 1 + T(\nl2\) for n > (8.4 a) 

For simplicity, we will assume that 

T(0) =0 (8.4 b) 

even though the conditions of Step 1 must be evaluated as false if n = 0. You might wonder why n/2 is truncated in 8.4a. If n is odd, then 
n = 2 k + 1 for some k > 0, the middle of the list will be the (k + l) st item, and no matter what half of the list the search is directed to, the 
reduced list will have k = |_«/2J items. On the other hand, if n is even, then n = 2 k for k > . The middle of the list will be the k' h item, and 

the worst case will occur if we are directed to the k items that come after the middle (the (k + l) st through (2 kf h items). Again the reduced 
list has |_«/2J items. 

Solution to 8.4 a and 8.4 b. To determine T (n), the easiest case is when n is a power of two. If we compute T (2 m ), m > , by iteration, our 
results are 

T(\) = 1 + T(0) = 1 

T(2) = 1 + T(l) = 2 

T(4) = 1 + T{2) = 3 

T(8) = 1 + 7" (4) = 4. 

The pattern that is established makes it clear that T(2 m ) = m + 1 . This result would seem to indicate that every time you double the size of 
your list, the search time increases by only one unit. 

A more complete solution can be obtained if we represent n in binary form. For each n > 1 , there exists a non - negative integer r such that 

2'-'<n<2'~ (8.4 c) 

For example, if n = 21, 2 4 < 21 < 2 5 ; therefore, r = 5. If n satisfies (8.4c), its binary representation requires r digits. For example, 21 ten = 
10101 two . 

In general, n = («i «2 ■■■ a r)two- where a\ = 1. Note that in this form, Yn/2\ is easy to describe: it is the r — 1 digit binary number 

(«1 O-T. ■■■ °r-l)two 
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Therefore, 

T (n) = T(ai ci2 ... a r ) 

= 1 + T(ai ai ... a r -\) 

= 1 + ( 1 + T(a i ci2 ... a r -2)) 

= 2 + T(ai a^ ... a r -2) 

= (r-1) + r(aj) 

= (r- 1)+ 1 since r(l)= 1 

= r 

From the pattern that we've just established, T (n) reduces to r. A formal inductive proof of this statement is possible. However, we expect 
that most readers would be satisfied with the argument above. Any skeptics are invited to provide the inductive proof. If n = 21 : 

T(2\) = 7X10101) 

= 1 + 7X1010) 

= i + (i + r(ioi)) 
= l + (i + (i + r(io») 

= 1 + (1 + (1 + (1 + T(\)))) 

= 1 + (1 + (1 + (1 + (1 + 7X0))))) 

= 5 

Conclusion; The solution to 8.4 a and 8.4b is that for n > 1 , T (n) = r, where 2 r ~' <n <2 r . 

A less cumbersome statement of this fact is that T(n) = |_log 2 «J + 1. For example, T(2\) = Llog 2 21J + 1=4+1=5 . 

REVIEW OF LOGARITHMS 

Any discussion of logarithms must start by establishing a base, which can be any positive number other than 1 . With the exception of Theorem 
8.4.1, our base will be 2. We will see that the use of a different base (10 and e ~ 2.171828 are the other common ones) only has the effect of 
multiplying each logarithm by a constant. Therefore, the base that you use really isn't very important. Our choice of base 2 logarithms is 
convenient for the problems that we are considering. 

The base 2 logarithm of a positive number represents an exponent and is defined by the following equivalence for any positive real number a. 

log 2 a = x <=> 2* = a . 

For example, log 2 8 = 3 because 2 3 = 8 and log 2 1 .414 as 0.5 because 2° 5 ~ 1.414 .A graph of the function / (x) = log 2 x in Figure 8.4.1a 
shows that if a < b, the log 2 a < log 2 b; that is, when x increases, log 2 x also increases. However, if we move x from 2 10 = 1024 to 
2" = 2048, log 2 x only increases from 10 to 11. This slow rate of increase of the logarithm function is an important point to remember. An 
algorithm acting on n pieces of data that can be executed in log 2 n time units can handle significantly larger sets of data than an algorithm that 

can be executed in nj 100 or even yn time units (see Figure 8.4.1b). The graph of T(n) = |_log 2 n\ + 1 would show the same behavior. 
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Figure 8.4.1 Log graphs 

A few more properties that we will use in subsequent discussions involving logarithms are summarized in the following theorem. 
Theorem 8.4.1. Fundamental Properties of Logarithms. Let a and b be positive real numbers, and r a real number. 

log 2 l = (8.4 d) 

\o% 2 ab = log 2 a + log, b (8.4 e) 



lo 82 1 = lo &2 a - lo g2 b 
log 2 a r = r log, a 

2 lo6 2 " = a 



(8.4f) 
(8.4 g) 
(8.4h) 



Returning to the binary search algorithm, we can derive the final expression for T (n) using the properties of logarithms, including that the 
logarithm function is increasing so that inequalities are maintained when taking logarithms of numbers . 

T(n) = r a 2'- 1 <n<2' 

<=> log 2 2 r ~' < log 2 n < log 2 2'' 

<=> r - 1 < log 2 n < r 

<=> r - 1 = |_log 2 n\ 

«=> T(n) = r = Llog 2 n\ + 1 

We can apply several of these properties of logarithms to get an alternate expression for T(n): 

Llog 2 n\ + \ = Llog 2 n + 1J 

= Uog 2 n + log 2 2J 
= Llog 2 2 « J 
Definition: Logarithms base b, 1} b > 0, b ±\, then for a >0, 



log b a 



V = a 
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log a 

Theorem 8.4.2. Let b > 0, b #1. Then for all a > 0, log b a = . Therefore, ifb > 1 , base b logarithms can be obtained from base 2 

logarithms by dividing by the positive scaling factor log 2 b. Ifb < 1, this scaling factor is negative. 

Proof: By an analogue of 8.4f, a = fo log * a . Therefore, if we take the base 2 logarithm of both sides of this equality we obtain: 

log 2 a = log 2 (6 10 ^ a ) 
=> log 2 a = log,, a log 2 b 
To obtain the desired result, divide both sides of this last equation by log 2 b. ■ 

Note:log 2 10 « 3.32192 and log 2 e = 1.55269. 

If the time that was assigned to Step 1 of the binary search algorithm is changed, we wouldn't expect the form of the solution to be very 
different. If T(n) = a + T({n/2 J) with T (0) = c,thenT(n) = c + a|_log 2 2«J. 

A further generalization would be to add a coefficient to T ([n/2 J): T(n) = a + bT ([n/2 J) with T (0) = c, where a, b, c e R, and b 4- is 
not quite as simple to derive. First, if we consider values of n that are powers of 2: 

7X1) = a + bT(0) = a + be 

7X2) = a + b{a + b c) = a + ab + cb 2 

T(4) = a + b(a + ab + cb 2 ) = a + ab + ab 2 + c b 3 

T(2 r ) = a + ab + ab 2 + ■■■ + ab' + cb r+1 
If n is not a power of 2, by reasoning that is identical to what we used to solve 8.4a and 8.4b, 

T(n) = Y.ab k + cb r+[ 

where r = U°g2 n \ ■ 

The first term of this expression can be written in closed form. Let x be that sum: 

x = a + ab + ab 2 + ■■■ + ab T 
bx= ab + ab 2 + ■■■ + ab' + ab r+{ 

We've multiplied each term of x by b and aligned the identical terms in x and bx. Now if we subtract the two equations, 

x - b x = a - ab 
x{\ -b) = a(\ -b r+1 ) 
Therefore, 

b'* { -\ 

x = a 

6-1 



A closed form expression for T (n) is 

T(n) = a + c b' + ' where r - |_log 2 ij 

Example 8.4.3. The efficiency of any search algorithm such as the binary search relies on fact that the search list is sorted according to a key 
value and that the search is based on the key value. There are several methods for sorting a list. One example is the bubble sort. You might be 
familiar with this one since it is a popular "first sorting algorithm." A time analysis of the algorithm shows that if B (n) is the worst-case time 
needed to complete the bubble sort on n items, then B(n) = (n - 1) + B(n - 1) and B(l) = 0. The solution of this relation is a quadratic 

function B(n) = - (n 2 - n). The growth rate of a quadratic function such as this one is controlled by its squared term. Any other terms are 

dwarfed by it as n gets large. For the bubble sort, this means that if we double the size of the list that we are to sort, n changes to 2 n and so n 2 
becomes 4n . Therefore, the time needed to do a bubble sort is quadrupled. One alternative to bubble sort is the merge sort. Here is a simple 
version of this algorithm for sorting F = {r(l), r(2), . . . , r(n)), n > 1 . If n = 1 , the list is sorted trivially. If n > 2 then: 

(1) Divide F into F t = (r(l), ■■■, r(|n/2J)} and F 2 = {r(|n/2J + 1), ..., r(n)). 

(2) Sort Fi and F 2 using a merge sort. 

(3) Merge the sorted lists F\ and F 2 into one sorted list. If the sort is to be done in descending order of key values, you continue to choose the 
higher key value from the fronts of F[ and F 2 and place them in the back of F. 

Note that F, will always have [n/2] items and F 2 will have [n/2~\ items; thus, if n is odd, F 2 gets one more item than Fj . We will assume that 
the time required to perform Step 1 of the algorithm is insignificant compared to the other steps; therefore, we will assign a time value of zero 
to this step. Step 3 requires roughly n comparisons and n movements of items from F[ and F 2 to F; thus, its time is proportional to n. For this 
reason, we will assume that Step 3 takes n time units. Since Step 2 requires T(L«/2J) + T(\n/2~]) time units, 

T(n) = n + nin/21) + r(fn/21) (8.4i) 
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with the initial condition 

7X1) = 0. (8.4j) 

Instead of an exact solution of 8.4i and 8.4j, we will be content with an estimate for T(n). First, consider the case of n = 2 r ,r > 1: 

T(2 l ) = T(2) = 2 + 7X1) + 7X1) = 2 = 1 ■ 2 
T(2 2 ) = 7X4) = 4 + 7X2) + 7X2) = 8 = 2-4 
T(2 3 ) = 7(8) = 8 + 7(4) + 7(4) = 24 = 3 • 8 



T(2 r ) = r2 r = 2 r log 2 2 r 

Thus , if n is a power of 2 , 7 (n) = « log 2 n . Now if, for some r > 2 , 2 r ~ ' < n < 2 r , then (r - 1 ) 2 r ~ ' < T(n) < r 2 T . This can be proven by 
induction on r. As n increases from 2 to 2' ', 7(n) increases from (r - l)2 r ~'to r2 r and is slightly larger than L"'°g2 n \- The discrepancy is 
small enough so that T e (n) = \n log 2 n\ can be considered a solution of 8.4i and 8.4j for the purposes of comparing the merge sort with other 
algorithms. Table 8.4.1 compares B (n) with T e (n) for selected values of n. 

Table 8.4.1 
Comparison of Times for Bubble Sort and Merge Sort 



n 


Bin) 


T e in) 


10 


45 


34 


50 


1225 


283 


100 


4950 


665 


500 


124750 


4483 


1000 


499 500 


9966 



Definition: Derangement. A derangement of a set A is a permutation of A (i.e., a bijection from A into A) such that f(a) j= a for all 
a e A. 

Example 8.4.4. If A = {1, 2, . . . , n), an interesting question might be "How many derangements are there of A?" We know that our answer is 
bounded above by n !. We can also expect our answer to be quite a bit smaller than n ! since n is the image of itself for (n - 1) ! of the permuta- 
tions of A. 

Let D (n) be the number of derangements of {1, 2, . . . , n). Our answer will come from discovering a recurrence relation on D. Suppose that 
n > 3. If we are to construct a derangement of {1, 2, ... , n),f, then / (n) = k 4- n. Thus, the image of n can be selected in n - 1 different 
ways. No matter which of the n - 1 choices we make, we can complete the definition of /in one of two ways. First, we can decide to make 
/ (k) = n, leaving D (n — 2) ways of completing the definition of/, since / will be a derangement of { 1 , 2, ..., n) - {n, k). Second, if we 
decide to select file) =/= n, each of the D(n - 1) derangements of {1, 2, ...,«- 1) can be used to define /. If g is a derangement of 
{1, 2, ..., n - 1 ) such that g (p) = k, then define f by 

n if j = p 

f(j) = { k if j = n 

g(j) otherwise 

Note that with our second construction of/ / (/ («)) = f(k) 4- n, while in the first construction, / (/ (n)) = / (k) = n. Therefore, no derange- 
ment of {1, 2, ..., n}with/(n) = k can be constructed by both methods. 

To recap our result, we see that /is determined by first choosing one of n - 1 images of n and then constructing the remainder of /in one of 
D in — 2) + D in - 1) ways. Therefore, 

D(n) = (n - \)(D(n - 2) + D(n - 1)). (8.4k) 

This homogeneous second-order linear relation with variable coefficients, together with the initial conditions D(\) = and D (2) = 1, 
completely defines D. Instead of deriving a solution of this relation by analytical methods, we will give an empirical derivation of an approxima- 
tion of D (n). Since the derangements of { 1 , 2 . . . , n) are drawn from a pool of n ! permutations, we will see what percentage of these permuta- 
tions are derangements by listing the values of n !, D (n), and — — . The results we obtain (see Table 8.4.2) indicate that as n grows, —2- hardly 

changes at all. If this quotient is computed to eight decimal places, for n > 12, D (n)/n ! = 0.36787944. The reciprocal of this number, which 
D (n)/nl seems to be tending toward, is, to eight places, 2.71828182. This number appears in so many places in mathematics that it has its own 

n \ 
name, e. An approximate solution of our recurrence relation on D is then D(n) ~ — 
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n 


D(n) 


D(n)/n! 


1 








2 


1 


0.50000000 


3 


2 


0.33333333 


4 


9 


0.37500000 


5 


44 


0.36666667 


6 


265 


0.36805556 


7 


1854 


0.36785714 


8 


14833 


0.36788194 


9 


133496 


0.36787919 


10 


1334961 


0.36787946 


11 


14684570 


0.36787944 


12 


176214 841 


0.36787944 


13 


2290792932 


0.36787944 


14 


32071101049 


0.36787944 


15 


481066515 734 
Table 8.4.2 


0.36787944 


D (n) compared to 


n\ 



EXERCISES FOR SECTION 8.4 
A Exercises 

1 . Solve the following recurrence relations. Indicate whether your solution is an improvement over iteration. 

(a) nS(n) - Sin - 1) = 0, 5(0) = 1. 

(b) T(k) + 3kT(k - 1) = 0,r(0) = 1. 

(c) U(k) - — U(k-l) = 0,k>2, (7(1) = 1. 

k 

2. Prove that if n >0, \n/2\ + [n/21 = n. (Hint: Consider the cases of n odd and n even separately.) 

B Exercises 

3. Solve as completely as possible: 

(a) T(n) = 3 + r(|n/2J), T(0) = 0. 

(b) T(n) = 1 + \n\nl2\), T(Q) = 2. 

(c) V(n) = 1 + y|n/8J),V(0) = 0. (Hint: Write n in octal form.) 

4. Prove by induction that if T(n) = 1 + T(\nj2 J), T(0) = 0,and2'""' <n<2' ,r > l,thenT(n) = r. (Hint: Prove by induction on r.) 

5. Use the substitution S (n) = T(n + 1)/T(n) to solve 

T(n)T(n-2)-T(nf = 1 for n > 2,withr(0) = l,T(l) = 6,<mdT(n) > 0. 

6. Use the substitution G(n) = T(n) 2 to solve 

T(nf - T(n - l) 2 = 1 for n > 1 , with T (0) = 10. 

7. Solve as completely as possible: 

(a) Q(n) = 1 + e([V^|), n > 2, g(l) = 0. 

(b) R(n) = n +R(\nl2\),n > 1 , /? (0) = 0. 

8. Suppose Step 1 of the merge sort algorithm did take a significant amount of time. Assume it takes 0.1 time unit, independent of the value of 
n: 

(a) Write out a new recurrence relation for T (n) that takes this factor into account; 

(b) Solve for T(2 r ), r > 0; 

(c) Assuming the solution for powers of 2 is a good estimate for all n, compare your result to the solution in the text. As gets large, is there 
really much difference? 
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I 8.5 Generating Functions 



This section contains an introduction to the topic of generating functions and how they are used to solve recurrence relations, among other 
problems. Methods that employ generating functions are based on the concept that you can take a problem involving sequences and translate it 
into a problem involving generating functions. Once you've solved the new problem, a translation back to sequences gives you a solution of the 
original problem. 

This section covers: 

(a) The definition of a generating function; 

(b) Solution of a recurrence relation using generating functions to identify the skills needed to use generating functions; 

(c) An introduction and/or review of the skills identified in point b; 

(d) Some applications of generating functions . 

Definition: Generating Function of a Sequence. The generating function of a sequence S with terms S , S/, S 2 , ... is the infinite sum 
G(S; z) = | S„ z n = S + S l z + S 2 z 2 +S 3 z 3 + --- 

n=0 

The domain and codomain of generating functions will not be of any concern to us since we will only be performing algebraic operations on 
them. 

Example 8.5.1. 

(a) IfS„ = 3",n > 0,then 

G(S;z) = 1 + 3z + 9z 2 + 27 z 3 + ••• 
= 1 3" z" 

ii=0 

= S(3z)" 

ii=0 

We can get a closed form expression for G (5; z) by observing that G(S; z) - 3 z G(5; z) = 1 . Therefore, 

G(S;z)=^- 

(b) Finite sequences have generating functions. For example, the sequence of binomial coefficients C (n; 0), C(n; 1), ...,C(n; n), n > 1 
has generating function 

G(C(n; •); z) = C (n; 0) + C (n; l)z + ••• + C (n; n)z" 
= ZC(n;k)z k 

k=0 
= (1+Z)" 

by application of the binomial formula. 

(c) If<2(n) = n 2 , 

G(Q;z)= 2>V= I A k 2 z k 

n=0 k=0 

Note that the index that is used in the summation has no significance. Also, note that the lower limit of the summation could start at 1 since 

2(0) = o. 

SOLUTION OF A RECURRENCE RELATION USING GENERATING FUNCTIONS 

Problem: Solve S (n) - 2S(n - 1) - 3 S (n - 2) = Q,n > 2, with 5(0) = 3andS(l) = 1. 

(1) Translate the recurrence relation into an equation about generating functions. 

In our example, let V (n) = S(n) - 2S(n - 1) - 3 S(n - 2), n > 2,with\/(0) = OandV(l) = 0. Therefore, 

G(V;z) = + 0z + | (S(n) - 2S(n - 1) - 3S(n - 2))z" = 

n=2 

(2) Solve for the generating function of the unknown sequence, G(S, z) = zZ^nZ". 

n=0 

0= E (S(n) - 2S(n - 1) - 3S(n - 2))z" 

n=2 



E S(n)z" - 2[ 2 S(n - I) A - 3 [ E S(n 

n=2 \n=2 ) \n=2 



■2)z" 

n=2 



Close examination of the three sums above shows: 
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(a) ZS n z"= I5„z" - 5(0)- 5(1) z 

n=2 n=0 

= G(5; z) - 3 - z 



since 5(0) = 3 and 5(1) = 1. 

(b) Z5(n-l)z"=z(i;5(n-l)z" 

n=2 Vi=2 



S%)z") 
n=i / 



z{zS(n)z"-S(0)) 
z(G(5; z) - 3 



(c) 2 S(n - 2) z' ! = z 2 2 S(n - 2) z"- 2 

n=2 Vn=2 

= z 2 G(5; z) 



Therefore, 

(G(5; z) - 3 - z) - 2 z(G(5; z) - 3) - 3 z 2 G(5; z) = 
=> G(5;z)-2zG(5;z)-3z 2 G(5;z) = 3 - 5z 

(3) Determine the sequence whose generating function is the one obtained in Step 2, 

For our example, we need to know one general fact about the closed form expression of an exponential sequence (a proof will be given later): 

T(n) = ba n , n>0 <=i G(T; z) = — (8.5a) 

l-az 

Now, in order to recognize 5 in our example, we must write our closed form expression for G (5; z) as a sum of terms like G (T; z) above. Note 
that the denominator of G (5; z) can be factored: 

U ^' Z >- l- 2z - 3z * "(1-3 Z )(1 +Z ) 

If you look at this last expression for G (5; z) closely, you can imagine how it could be the result of addition of two fractions, 
= + — (8.5b) 

(l-3z)(l+z) 1-3 z l+z 

where A and B are two real numbers that must be determined. Starting on the right of 8.5b, it should be clear that the sum, for any A and B, 
would look like the left-hand side. The process of finding values of A and B that make 8.5b true is called the partial fractions decomposition of 
the left-hand side: 



A B 


A(l+z) S(l-3z) 


1-3 z l+z (1- 


-3z)(l+z) (1-3 z) (l+z) 




(A+B)+(A-3 B) z 




(1-3 z) (l+z) 


Therefore, 




i A+B=3 \ 
I A - 3 B = -5 / 


/ A=1 \ 


and 




G(5;z)==^ 


2 

+ 

l+z 



We can apply 8.5a to each term of G(S;z): 

is the generating function for 5i(n) = 1 • 3" = 3" and 

2 . . 

— is the generating function for 52(«) = 2 (-1)". 

Therefore, S(n) = 3" + 2 (-1)". 

From this example, we see that there are several skills that must be mastered in order to work with generating functions. You must be able to: 

(a) Manipulate summation expressions and their indices (in Step 2). 

Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 United States 



157 



Chapter 8 - Recursion and Recurrence Relations 



157 



(b) Solve algebraic equations and manipulate algebraic expressions, including partial function decompositions (Steps 2 and 3). 

(c) Identify sequences with their generating functions (Steps 1 and 3) . 

We will concentrate on the last skill first, a proficiency in the other skills is a product of doing as many exercises and reading as many examples 
as possible. 

First, we must identify the operations on sequences and on generating functions. 

Operations on Sequences: Let 5 and T be sequences of numbers and let c be a real number. Define the sum 5 + T, the scalar product c 5, the 
product ST, the convolution 5*7% the pop operation 5T (read "S pop"), and the push operation 54- (read "S push") term- wise for k > by 

(S + T)(k) = S(k) + T(k) 
(c S) (k) = c S(k) 
(ST)(k) = S(k)T(k) 

(S*T)(k) = Y,S{j)T{k-j) 
(ST)(k) = S(k+ 1) and 



(Sl)(k): 







ifk = 



S(k-l) if /t > ' 



If one imagines a sequence to be a matrix with one row and an infinite number of columns , S + T and c S are exactly as in matrix addition and 
scalar multiplication. There is no obvious similarity between the other operations and matrix operations. 

The pop and push operations can be understood by imagining a sequence to be an infinite stack of numbers with S (0) at the top, 5(1) next, etc., 
as in Figure 8.5.1a. The sequence 5T is obtained by "popping" S(0) from the stack, leaving a stack as in Figure 8.5.1b, with S(l) at the top, S(2) 
next, etc. The sequence S I is obtained by placing a zero at the top of the stack, resulting in a stack as in Figure 8.5.1c. Keep these figures in 
mind when we discuss the pop and push operations. 



S(0) 




S(l) 







S(l) 


S(2) 


S(0) 


S(2) 


S(3) 


S(D 


S(3) 


S(4) 


S(2) 


S(4) 


S(5) 


S(3) 









(a) (b) (c) 

FIGURE 8.5.1 

Stack interpretation of pop and push operation 

Example 8.5.2. If 5 (n) = n,T(n) = n 2 , U (n) = 2",and/?(n) =n2" , 

(a) (5 + T) (n) = n + n 1 

(b) {U + R) in) = 2" + n 2" = (1 + n) 2" 

(c) (2U)(n) = 2-2"= 2" +1 



(d) C~R)(n)= i -n2"=n2 n 



(e) (5 T) (n) = n n 2 = n 3 2 = n 3 

(f) (5 * T) (n) = | 50) T{n - j) = £ j (n - j) 2 

j=0 >0 

= zh(jn 2 -2nj 2 + f) 

y=0 

n n n 

= n 2 Zj-lnZf + Z./ 3 

7=0 7=0 7=0 



= n 2 {^)-2n{ i2n ^ +l)n )+ y -n 2 (n+l) 2 

by Exercise 14 of Section 8 .3 . 
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(g) (U*U)(n) = ZU(j)U(n-j) 

j=0 

7=0 

= (n+ 1)2" 
(h) (ST) (n) = n + 1 

(i) (5i)(n) = max(0, n - \) 

(])((Sl)l)(n)= max(0, n - 2) 

(k)(f/i)(„) = { 2 * B> ° 
v if « = 

(l)(«n)T)(n) = «7i)(n + l)=2*= C/(n) 
if n = 



(m) ((t/t)A)(fi)-, . 

t/(n) if n > 

Notethat(CM)T*(C/T)J. 

Definition: Multiple Pop and Push: If S is a sequence of numbers, define 
S\p = (ST(p - -/))!(/■ p ^ 2 and St 1 = ST. 
Similarly, define 

Sip = (Slip - l))iif p >2andSU = Si. 
Notice that 

(S12)(k)= ((5tl)T)(Jfc) = ((5T)T)(Jfc)= (ST)(* + 1)= k + 2 
In general, 

(S1>)(*) = S(k + p), and 

Operations on Generating Functions 

if 

Gfe)= 2>*z* and //(z) = £*> t z* 

are generating functions and c is a real number, then the sum G + H, scalar product c G, product GH, and monomial product z p G, p > 1 are 
generating functions , where 

{G + H)(z)= lZ(a k + b k )z k 

k=0 

(cG)(z)= lZca k z k 

k=0 

co k 

(GH)(z) = >Zcz k where c k = 2Zajb k _j 

k=0 1=0 

{ Z p G) (z) = z" E a k z k =Za k z k+l > = | a n - p z" 

k=0 k=0 "=p 

The last sum is obtained by substituting n - p for k in the previous sum. 
Example 8.53. If 

D(z) = \lkz k &MH(z) = lZ2 k z k 

k=0 k=0 

then 
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(D + H)(z) = Z(k + 2 k )z k 

k=0 

(2H)(z) = f l 2-2 k z k = ±Z2 k+l z k 

k=0 k=0 

(zD) (z) = zHkz k = f,k z M = Z (k - \)z k 

k=0 k=0 k=i 

= D(z) - Z z" 

k=\ 



(DH)(z)= Z 

k=0 



Zj2 k -i 

V>0 



(HH)(z)= Z Z2 J 2 H 



k=o U=° 



z k = ±Z(k+l)2 k z k 



Note: D(z) = G(S;z), and //(z) = G(U;z) from Example 8.5.2. Now we establish the connection between the operations on sequences and 
generating functions . Let S and T be sequences and let c be a real number; 



G(S + T;z) = G(S;z) + G(T-z) 

G(c S; z) = c G(S; z) 

G(S*T;z) = G(S;z)G(T;z) 

G(St;z) = (G(S;z)-S(0))/z 

G(Sl;z) = zG(S;z) 



(8.5 c) 
(8.5 d) 
(8.5 e) 

(8.5/) 
(8.5 g) 

In words, 8.5c says that the generating function of the sum of two sequences equals the sum of the generating functions of those sequences. 
Take the time to write out the other four identities in your own words. From the previous examples, these identities should be fairly obvious, 
with the possible exception of the last two. We will prove 8.5f as part of the next theorem and leave the proof of 8.5g to the interested reader. 
Note that there is no operation on generating functions that is related to sequence multiplication; that is, G (S T; z) cannot be simplified. 

Theorem 8.5.1. If p > 1, 



(a) G(.Slp;z) = \G{S;z) -"lZ S(k)z k )/ z k 

(b) G(Slp;z) =z"G(S;z). 

Proof of Part (a): The proof is by induction. Basis: 



G(ST,z)= zZS(k+l)z k = ZS(k)z k 

k=0 k=l 



\lZS(k)z k 



= \S(0)+ lZS(k)z k -S(0)Uz 

= (G(5;z)-5(0))/z 
Therefore, part (a) is true for p = 1 . 

Induction, Suppose that for some p > 1 , the statement in part (a) is true: 

G(5T(p + D;z)= G((STp)T;z) 

= (G(5Tp;z)-(5fp)(0))/z bythebasis 



Gffi!)-ES(«z» 



S(P) 



by the induction hypothesis. Now write S(p) in the last expression above as (S(r) z p )l z p so that it fits into the finite summation: 
G(5;z)-Z«*)z* 



G(S;z)- 2ZS(k)z k \/z" + ' 

k=0 
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Therefore the statement is true for p + 1 . ■ 

CLOSED FORM EXPRESSIONS FOR GENERATING FUNCTIONS 

The most basic tool used to express generating functions in closed form is the closed form expression for the geometric series, which is an 
expression of the form a + ar + ar 2 + • • • . It can either be terminated or extended infinitely. 

Finite Geometric Series: a + ar + ar 2 + ■■■+ar"= a[ — - — I (8.5h) 

Infinite Geometric Series: a + ar + ar 2 + ■■■ = — (8.5i) 

l-r 

Restrictions: a and r represent constants and the right sides of the two equations apply under the following conditions: 

(1) r must not equal 1 in the finite case. Note that a + ar + ■■■ar" = (n + l)aif r = 1. 

(2) In the infinite case, the absolute value of r must be less than 1 . 

These restrictions don't come into play with generating functions. We could derive 8.5h by noting that if S (n) = a + ar + ■■■ + ar", n > 0, 
then S(n) = r S(n - V) + a (See Exercise 10 of Section 8.3). An alternative derivation was used in Section 8.4. We will take the same steps 
to derive 8.5 i. Let 

x = a + ar + a r + ••■ 
r x = ar + a r 2 + ■■■ 

x - r x = a 

Therefore, x = — . 

l-r 

Example 8.5.4. 

(a) If S(ri) = 9-5",n > 0, G (5; z) is an infinite geometric series with a = 9 and r = 5 z. Therefore, 

G(S; Z )=^- 

(b) IfT(n) = 4,n > 0, then G(r; z) = 4/(1 - z). 
(c)If U(ri) = 3(-l)",thenG(C/;z) = 3/(1 + z). 

(d) Let C(n) = S(n) + T(n) + U (n) = 9 • 5 + 4 + 3(-l)". Then 



G(C;z) = G(S;z) + C(T;z) + G(U;z) 

9 4 3 

l-5z 1-z l+<- 

14z 2 +34z-16 

5z 3 -z 2 -5z+l 

Given a choice between the last form of G(C;z) and the previous sum of three fractions, we would prefer leaving it as a sum of three 
functions. As we saw in an earlier example, a partial fractions decomposition of a fraction such as the last expression requires some effort to 
produce. 

17 

(e) If G (Q; z) = 34/(2 - 3 z), then Q can be determined by multiplying the numerator and denominator by 1/2 to obtain — — . We recog- 

l--z 

2 

nize this fraction as the sum of the infinite geometric series with a = 17 and r = - z. Therefore Q(n) = 17(3/2)". 

(f) If G(A;z) = (1 + zf , then we expand (1 + zf to / + 3z + 3z 2 + z° . Therefore A(0) = 1, A(l) = 3 A (2) = 3, A(3) = 1, and, 
since there are no higher-powered terms, A (n) = 0, n > 4. A more concise way of describing A is A (k) = C(3; k), since C(n; k) is usually 
interpreted as of k > n. 

Table 8.5.1 contains some closed form expressions for the generating functions of some common sequences. 
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Sequence Generating Function 

S(k) = ba k G(S; z)= — 

l—az 

S(k) = k G(S;z)=- L - 



S(k) = bka k G(S;z)-- (l _ 

S(k) = i GQS; z) = e z 

(C(n;k) 0<k<n 
S(k) = ' G(S;z) = (l+z)" 

\ k > n 

Table 8.5.1 

Closed Form Expressions of some Generating Functions 

Example 8.5.5. Solve S(k) + 3S(k - 1) - 4 S[k - 2) = 0, k > 2, with S(0) = 3 and 5(1) = -2. The solution will be obtained using 
the same steps that were used earlier in this section, with one variation. 

(1) Translate to an equation about generating functions. First, we change the index of the recurrence relation by substituting n + 2 for k. The 
result is S(n + 2) + 3S(n + 1) - 4S(n) = 0, n > 0. Now, if V (n) = S (n + 2) + 35 (n + 1) - 4S(n), then V is the zero sequence, 
which has a zero generating function. Furthermore, V = ST2 + 3(ST)-4S. Therefore, 

= G(V;z) 

= G(5T2; z) + 3G(5t;z) - 4G(5;z) 

G(S;z)-S(0)-S(l)z , , (G(S;z) - S(0)) . 

= - 2 + 4 - 4 G (5; z) 

(2) We want to now solve the following equation for G (5; z): 

G(S;z)-S(0)-S(l)z , . (CHS;z)-S(0)) . 

; 1- 4 - 4 G(S; z) = 

z z 

Multiply by z 2 : 

G(S;z) - 3 + 2z + 3z(G(5;z) - 3) - 4z 2 G(5;z) = 
Expand and collect all terms involving G(5; z) on one side of the equation: 

G(S;z) + 3zG(5;z) - 4z 2 G(5;z) = 3 + 7z 
(1 + 3z - 4z 2 )G(5;z)= 3 + 7z 
Therefore, 

(3) Determine S from its generating function. 

1 + 3z - 4z 2 = (1 + 4z)(l - z) 
thus a partial fraction decomposition of G (5; z) would be: 

A B _ Az-A-ABz-B 

1+4 z 1-z ~ (z-l)(4z+l) 

_ (A+B)+(4B-A); 

<z-l)(4z+l) 

Therefore, A + B = 3and4fi - A = 7 . The solution of this set of equations is A = 1 and B = 2. 
is the generating function of S[(n) = (-4)", and 

2 

— is the generating function of £2(1) = 2(1)" = 2. 
In conclusion, since G (5; z) = G(5i;z) + G(S 2 \ z) , S (n) = 2 + (-4)". 

Example 8.5.6. Let A = {a, b, c, d, e) and let A* be the set of all strings of length zero or more that can be made using each of the 
elements of A zero or more times. By the generalized rule of products, there are 5" such strings that have length n,n>0, Suppose that X n is 
the set of strings of length n with the property that all of the a's and fc's precede all of the c's, rf's, and e's. Thus aaabde e X 6 , but 
abcabc £ X$, Let R(n) = \X„ \. A closed form expression for R can be obtained by recognizing R as the convolution of two sequences. To 
illustrate our point, we will consider the calculation of R (6). 

Note that if a string belongs to X 6 , it starts with k characters from {a, b\ and is followed by 6 - k characters from {c, d, e). Let S (k) be the 
number of strings of a's and b's with length k and let T (k) be the number of strings of c's, </s, and e's with length k. By the generalized rule of 

products, S (k) = 2 k and T (k) = 3 k . Among the strings in Xf, are the ones that start with two a's and b'% and end with c's, d's, and e's. There 
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are 5(2)7(4) such strings. By the law of addition, |X 6 | =R(6) = S(0)T(6) + 5(1)7(5) + ••• + 5(5) 7(1) + 5(6)7(0). Note that the sixth 
term of R is the sixth term of the convolution of 5 with 7, 5*7. Think about the general situation for a while and it should be clear that 
R =5*7. Now, our course of action will be to: 

(a) Determine the generating functions of 5 and 7, 

(b) Multiply G (5; z) and G (7; z) to obtain G (5* 7; z) = G(R; z) (by 10.5e),and 

(c) Determine R on the basis of G (R; z) ■ 

(a) G(5;z) = f 2* z* = -±- , and G(7; z) = E3*z*=-^- 

(b) G(K;z) = G(5;z)G(7;z) 



(1-2 z) (1-3 z) 

(c) To recognize R from G (/?; z), we must do a partial fractions decomposition: 

1 _ A B _ -3 A z+A-2 B z+fl _ (A+fl)+(-3 A -2B) z 



(l-2z)(l-3z) l-2z 1-3 z (2z-l)(3z-l) (2z-l)(3z-l) 

Therefore, A + B = land -3 A - 2B = 0. The solution of this pair of equations is A = - 2 and B = 3. 
Since 

-2 3 

G(R;z) = —^- + - J ~, 

v ' l-2z l-3z 

which is the sum of the generating functions of -2 (2)* and 3 (3)*, 

R(k) = -2 (2) k + 3 (3)* = 3 k+[ - 2 M 
For example, R (6) = 3 7 - 2 7 = 2187 - 128 = 2059. Naturally, this equals the sum that we get from (5*7) (6). To put this number in 

perspective, the total number of strings of length 6 with no restrictions is 5 6 = 15 625, and « 0.131776. Therefore approximately 13% 

of the strings of length 6 satisfy the conditions of the problem. 

EXTRA FOR EXPERTS 

The remainder of this section is intended for readers who have had, or who intend to take, a course in combinatorics. We do not advise that it be 
included in a typical course. The method that was used in Example 8.5.6 is a very powerful one and can be used to solve many problems in 
combinatorics. We close this section with a general description of the problems that can be solved in this way, followed by some examples. 

Consider the situation in which Pi, P 2 , . .., P m are m actions that must be taken, each of which results in a well-defined outcome. For 
each k = 1,2, . . . , m define X k to be the set of possible outcomes of P k . We will assume that each outcome can be quantified in some way 
and that the quantification of the elements of X k is defined by the function Q k : X k -» {0, 1,2, . . .). Thus, each outcome has a non-negative 
integer associated with it. Finally, define a frequency function F k : {0, 1, 2, . . .) -> {0, 1, 2, . . .) such that F k (n) is the number of elements 
of X k that have a quantification of n. 

Now, based on these assumptions, we can define the problems that can be solved. If a process P is defined as a sequence of actions 
Pi, P2, ..., 7 m as above, and if the outcome of P, which would be an element of Xi xX 2 X ■■■ xX m , is quantified by 

m 

Q(a u a 2 , ..., a m )= zZQ k (a k ), 

k=l 

then the frequency function, F, for P is the convolution of the frequency functions for P 1( P 2 , ..., P m , which has a generating function equal to 
the product of the generating functions of the frequency functions F t , F 2 , . . . , F m . That is, 

G(F; z) = G(F,; z) G(F 2 ; z) •••G(F,„; z) (8.5j) 

Example 8.5.7. Suppose that you roll a die two times and add up the numbers on the top face for each roll. Since the faces on the die represent 
the integers 1 through 6, the sum must be between 2 and 12. How many ways can any one of these sums be obtained? Obviously, 2 can be 
obtained only one way , with two 1 's . There are two sequences that yield a sum of 3 : 1-2 and 2- 1 . To obtain all of the frequencies with which the 

numbers 2 through 12 can be obtained, we set up the situation as follows. For j = 1, 2; Pj is the rolling of the die for the f h time. 

Xj = {1, 2, . . . , 6) and Qj : Xj— > {0, 1, 2, 3, ...) is defined by Qj(x) = x. Since each number appears on a die exactly once, the 

frequency function is Fj(k) = 1 if 1 < k < 6, and Fj(k) = otherwise. The process of rolling the die two times is quantified by adding up the 

Qj' s; that is, Q(a\, a 2 ) = Qi(cii) + Q 2 (a 2 ) ■ The generating function for the frequency function of rolling the die two times is then 

G(F;z) = G{Fi;z)G{F 2 ;z) 

= {z 6 +z 5 +z A + z 3 +z 2 +zf 

= (z n +2z li +3z 10 +4z 9 + 5z 8 +6z 7 +5z 6 +4z 5 + 3z 4 + 2z 3 + z 2 ) 

Now , to get F(k) , just read the coefficient of z k ■ For example , the coefficient of z 5 is 4 , so there are four ways to roll a total of 5 . 

To apply this method, the crucial step is to decompose a large process in the proper way so that it fits into the general situation that we've 
described. 

Example 8.5.8. Suppose that an organization is divided into three geographic sections, A, B, and C. Suppose that an executive committee of 11 
members must be selected so that no more than 5 members from any one section are on the committee and that Sections A, B, and C must have 
minimums of 3, 2, and 2 members, respectively, on the committee. Looking only at the number of members from each section on the commit- 
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tee, how many ways can the committee be made up? One example of a valid committee would be 4 A's, 4 B's, and 3 C's. 

Let P A be the action of deciding how many members (not who) from Section A will serve on the committee. X A = {3, 4, 5) and Q A (k) = k. 

The frequency function, F A , is defined by F A (k) =1 if k e X k , with F A (k) = otherwise. G(F A ; z) is then z 3 + z 4 + z 5 ■ Similarly, 

G(F B ; z) = Z 2 + Z 3 + Z 4 + z 5 = G(Fc',z)- Since the committee must have 11 members, our answer will be the coefficient of z" in 
G(F A ; z) G(F B ; z) G(F C ; z), which is 10: 

G(F A ; z) G(F B ; z) G(F C ; z) = (z 3 + z 4 + z 5 ) (z 2 + z 3 + z 4 + z 5 f 

= z 15 + 3z I4 + 6z 13 +9z 12 + 10z 11 +9z 10 + 6z t, + 3z 8 +z 7 

EXERCISES FOR SECTION 8.5 
A Exercises 

1 . What sequences have the following generating functions? 



(a) 


l 


(b) 


10 

2-z 


(c) 


1 + Z 


(d) 


3 , 3 
l+2z l-3z 


2. What sequences have the following generating functions? 


(a) 


1 
1+- 


(b) 


1 
4-3 z 


(c) 


2 i 1 
l-z 1+z 


(d) 


z+2 
z+3 


B Exercises 



3. Find closed form expressions for the generating functions of the following sequences: 

(a) V(n) = 9" 

(b) P, where P (k) - 6P(k - 1) + 5P(k - 2) = for k > 2,with/>(0) = 2andP(l) = 2. 

(c) The Fibonacci sequence: F (k + 2) = F (k + 1) + F (k),k > 0,withF(0) = F(l) = 1. 

4. Find closed form expressions for the generating functions of the following sequences: 

(a) W (n) = C(5; n) 2" for < n < 5 and W (n) = for n > 5. 

(b) Q, where Q (k) + Q(k - 1) - 42 Q (k - 2) = for k > 2, with Q (0) = 2 and 2(1) = 2. 

(c) G, where G(k + 3) = G(k + 2) + G(k + 1) + G(k)ioxk > 0,withG(0) = G(l) = G(2) = 1. 

5. For each of the following expressions, find the partial fraction decomposition and identify the sequence having the expression as a generat- 
ing function. 

(a) 7^7 

(b) 32 ^ 



(c) 



2-3 z+z 2 
6-29 z 



l-llz+30z 2 

6. Find the partial fraction decompositions and identify the sequence having the following expressions: 

(a) -h 

(b) 1+3Z 



(C) 



16-8 z+z 2 

2z 
l-6z-7z 2 

■,th 



7. Given that S (k) = k and T(k) = 10 k, what is the k term of the generating function of each of the following sequences: 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 United States 



Chapter 8 - Recursion and Recurrence Relations 



(a) S + T 

(b) ST * T 

(c) S * T 

(d) ST*ST 

8. Given that P (k) = C(10; k) and Q (k) = &!, what is the k th term of the generating function of each of the following sequences: 

(a) P * P 

(b) P + Pt 

(c) P*<2 

(d) Q*Q 
C Exercises 

9. A game is played by rolling a die five times. For the k th roll, one point is added to your score if you roll a number higher than k. Otherwise, 
your score is zero for that roll. For example, the sequence of rolls 2, 3, 4, 1,2 gives you a total score of three; while a sequence of 1,2,3,4,5 

gives you a score of zero. Of the 6 5 = 7776 possible sequences of rolls, how many give you a score of zero?, of one? ... of five? 

10. Suppose that you roll a die ten times in a row and record the square of each number that you roll. How many ways could the sum of the 
squares of your rolls equal 40? What is the most common outcome? 
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I 8.6 Recursion and Computer Algebra Systems 



There is frequently debate as to if, how, and when computers should be introduced teaching a topic such as recurrence relations. We have 
chosen to append this information at the end of the chapter. If desired, instructors can intersperse the following subsections with the previous 
sections, cover this section a the end, or simply ignore this section. Motivated students are welcome to read it at any time, but should be 
warned that many instructors will still prefer that they do some of the calculations "by hand" within their courses. 

Note: At the present time, this section only describes the topics in Mathematica. If any readers with a knowledge of Sage or any other com- 
puter algebra system would like to contribute to this section, contact us through the addresses that appear at http://faculty.uml.edu/kl- 
evasseur/ads2/. 

Recursive Definitions (8.1 and 8.2) 
Mathematica 

Functions can be defined recursively in Mathematica. For example, the Fibonacci sequence, 1, 1, 2, 3, 5, 8, 13, . 



(each number is the sum of 



the previous two) can be defined with the following input . 

fib[0] = 1; 
fib[l] = 1; 
fib[i_ /; i > 1] := fib[i - 2] + fib[i 



1] 



The left side of the third line, fib [ i_/ ; i>l ] indicates that fib is being defined in this case for a "pattern" called i satisfying the condition 
that i is greater than 1 . Any "input" that fits this pattern will have this rule applied to it. Notice that the definitions for f ib [ ] and f ib [ 1 ] 
coexist with this general rule and define fib for two numbers that don't fit the pattern described in the general rule. 

With the input above evaluated, we can determine the 12th Fibonacci number. 
fib[12] 

233 

This particular definition is not efficient because previously computed values of fib are not saved. As a result, the time it takes to compute 
f ib [ i ] is roughly proportional to the value of f ib [ i ] , which grows exponentially. This can be demonstrated with the following bit of code. 

fp = ListPlot [ ({ttl, fib[Bl] } &) /©Range [3, 21, 3] , 

Joined -» True, PlotLabel -» "Plot of fib[i] "] ; 
tfp = ListPlot [ ({ttl, First [Timing [fib [Bl] ] ] } &) /©Range [3, 21, 3] , 

Joined -» True, PlotLabel -» "Timing Plot for computing fib[i]"]; 
Show[GraphicsRow[{fp, tfp}, ImageSize ~> 1 .5 {288, 84.75"}]] 





Plot of fib[i] 


Tii 


ning Plot for computing fib[i] 


5000 




0.04 


: / 






0.03 


- 1 


0000 






: / 






0.02 


- / 


5000 




0.01 


} . _/ . 



Taking over three seconds to compute the 30 th Fibonacci number is unacceptable. 
{Timing [fib [30] ] , $System} 

({3.32795, 1 346269}, Mac OS X x86 (64-bit)) 

The timing result above depends on the system you use, so you might want to execute this expression for comparison. 

How to get around the timing problem. You could never get a value like fib [ 100 ] using the recursive definition because of the obvious 
time growth. To get around the problem, you can define a variation of the function that saves it's results. 

The following definition allows for saving the results that are computed. Notice that the SetDelay ( : =) involves a Set (=). 

fib2[0] = 1; 
fib2[l] = 1; 
fib2[i_ /; i > 1] := fib2[i] = fib2[i - 2] + fib2[i - 1] 

Now let's compute the 30th number in the sequence with this new definition 
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Timing[fib2 [30] ] 

{0.000317, 1346269) 
Now that the value of fib [ 30 ] has been computed, it's value is returned almost instantly 

Timing[fib2 [30] ] 

(0.000012, 1346269) 
Now you can see what f ib2 [100] equals in a reasonable time. 

Timing [fib2 [100] ] 

(0.000499,573147 844013 817084101) 

There is still a problem with using recursion. If we were to ask for the 500 th Fibonacci number at this point, Mathematica would have difficulty 
because it still had 400 levels of recursion to negotiate. For this reason, getting to the 500 th Fibonacci number is best reached by evaluating 
every 50 th or so of the numbers instead of directly asking for the 500 th 

Do[fib2[50k] , {k, 3, 10}]; 

fib2[500] 

225 591516161936330872512695036072072046011324913 758190588 638 866418474627738 686883405 015 987052796'-. 
968498626 

Solution of Recurrence Relations (8.3 and 8.4) 
Mathematica 

Mathematica has a function called RSolve that will solve some recurrence relations, including linear recurrence relations with constant 
coefficients. In addition, it can handle some systems of recurrence relations, which we will discuss in a Chapter 12. 

? RSolve 

RSolve[e^«, a[n], n] solves a recurrence equation for a[n]. 

RSolve[(e^n,, eqn^, ...), [ai[n], a 2 [n], ...), n] solves a system of recurrence equations. 

RSolve[e^?, of.ni, »2, ■••], (»i, »2, •••}] solves a partial recurrence equation. » 

Example. Consider the sequence 5 defined as follows . 

S(k)= l.3S(k-l) -0.7 with 5(0)= 3 

We can get the general solution to the recursive part of the definition, if desired. 

recurence = S[k] == 1.3 S [k - 1] - 0.7; 
RSolve [ recur ence, S[k], k] 

{{SQc) -> 1.3*" 1 ' (2.33333 X2.71828 - 262364 - 0262364 * - 3.03333) + c, 1.3* -1 }} 
Or we can add the initial condition to the first argument of RSolve and get a unique solution 
solution = RSolve [{recurence, S[0] == 3}, S[k], k] 



{{S(k)^> 0.769231 x 2.71828"° 26236 " (3.03333 x 1.3* +0.866667 x 1.3*2.71828' 



0.262364 k\ 



Notice that since approximate numbers 1.3 and 0.7 are involved in the definition of 5, the solution involves approximate numbers. If we had 
used — and —.respectively, the solution would be exact. 
Next we assemble a list of ordered pairs based on the solution. 
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points = Table [{k, S[k] } /. First [solution] , {k, 0, 8}] 



'0 


3. 


1 


3.2 


2 


3.46 



3 3.798 

4 4.2374 

5 4.80862 

6 5.55121 

7 6.51657 
U 7.77154 ) 

Here is a plot of the first few terms of the solution. 

ListPlot [points, PlotStyle -» {Red, PointSize [0.03] } , AxesOrigin -» {0, 0}] 



2 4 6 8 

Example. Here is the solution to a second order linear recurrence relations with constant coefficients. Initial conditions included. Notice that 
the exact data in the system produces exact expressions. 

RSolve[{T[k] -6T[k-l]+3T[k-2] == 2"k, T[0] ==2,T[1] ==3},T[k],k] 



{{T{k)^-(5x3 1 



-19 



V2~3* + ^f3 - V^ 



■ 28 x 3* 



6 +19V2 3*3 + 



^6 



28 x 3 k+[ (3 + yfe'j + 3 x 2 M {{3 - 77) (3 + V^))l)/( 4 ( 2 V^" " ^ ( 3 + 2 A/^~) 2 



Example. Here is the solution to a recurrence relation similar to the ones described in Section 8.4. The solution is in terms of the Gamma 
function, which is a generalization of the factorial function. 

Usol = U[k] /. First [RSolve[{u[k] -kU[k- 1] == 2" k , U[0] == l}, U[k] , k] ] 



V^r 



k+l, 



Here is a table of the first few terms of the solution. 
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Table[{j, N[Usol /. {k- j}]}, {j, 0, 10}] 



f 


1. 


1 


1.5 


2 


3.25 


3 


9.875 


4 


39.5625 


5 


197.844 


6 


1187.08 


7 


8309.55 


8 


66476.4 


9 


598288. 



10 5.98288 xlO 6 _ 

Example. Here is the solution to the recurrence for the number of derangements of a set of n elements, D(ri). We use D instead of D here 
because Mathematical reserves the name D for the derivative function. 



derangementSol = RSolve[ {£>[n] == (n - 1) ( 2>[n - 1] + 2>[n - 2] ) , ©[1] == 0, Z>[2] 

r(2, -i)r(n + i)-r(n+ 1, -i). 



1}, 2)[n], n] 



{{£>(«) 



2T(2, -l)-r(3, -1) 
Table[{n, S[n]} /. First [derangementSol] /. {n-»k}, {k, 1, 10}] //Round 



< 1 


1 


2 


1 


3 


2 


4 


9 


5 


44 


6 


265 


7 


1854 


8 


14833 


9 


133496 


v 10 


1334961 ; 



Generating Functions (8.5) 

The examples in Section 8.5 solving recurrence relations with generating functions were selected to keep the algebra reasonably neat. Here we 
will step through the solution of a second order linear recurrence relation with constant coefficients for which the numbers are not so nice. 
Using Mathematica, the messy work isn't a problem. 

Mathematica 

Consider the recurrence relation S(k) = -S(k - 1) + 5 S(k - 2), with S (0) = 4 and S (1) = 3. In order to check our computations, lets define S 
recursively and get its first few terms: 

S[0] = 4; S[l] = 3; 

S[k_] :=S[k] =-S[k-l]+5S[k-2] 

Map[S, Range [0, 10]] 

(4, 3, 17, -2, 87, -97, 532, -1017, 3677, -8762, 27 147) 

We can find the generating function for S by observing that S = -S 1 +5 S T 2 . Using the formulas for the generating functions of S T and 
ST2 that were derived in Section 8.5, we get the following equation which we can solve for G = G(S; z): 

Clear [G] 

(G-S[0] -S[l] z) (G-S[0]) 

gf Equation = + 5 G == 

z 2 z 

G-3z-4 G-A 
+ 5G = 

z 2 z 

We solve for G and since there is only one solution, we use First to extract that single solution from the output of Solve. 
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Gsol = G /. First [Solve [gf Equation, G] ] 

-7z-4 



At this stage, we can extract a finite number of coefficients from the generating function. We do so here, but it only verifies the list of terms we 
computed above. We still don't have a closed form expression for S. 

seriesExpansion = Series[Gsol, {z, 0, 10}] 

4 + 3z+ 17z 2 -2z 3 +87z 4 -97z 5 + 532z 6 - 1017 z 7 + 3677 z 8 - 8762 z 9 + 27 147 z 10 + 0(z n ) 
Coef f icientList [ seriesExpansion, z] 

(4, 3, 17, -2, 87, -97, 532, -1017, 3677, -8762, 27 147) 

In order to get a close form expression we need to get partial fractions decomposition of the generating function. The function Apart is 
meant to do this, but the next result isn't very encouraging 

Apart [Gsol] 

-7z-4 



5 z 2 - z - 1 
The difficulty is that Apart uses the Factor function that only factors polynomials over into factors with rational coefficients. 

Factor [Gsol] 

7z + 4 



5 z 2 - z - 1 

In order for factor to work on this denominator, we need to tell Mathematical what square root should be included. The discriminant of the 
denominator is 

Discriminant [5 z 2 - z - 1 , z] 

21 

Therefore we need to extend the rational numbers to include V 21 . We will discuss extensions in a more formal setting in Chapter 16. 
Factor[Gsol, Extension -> |V2l}] 

20(7z + 4) 



(-10z + V2T+ l)(l0z + V2T- l) 
With this factored expression, Apart will do its job. 

PF = Apart [Factor [Gsol, Extension-* {Sqrt[21]}]] 



47-7V21 47 + 7V21 



V2T(l0z + V2T- l) V2f(-10z + V2T + l) 

We will work with the two terms above, both of which is the generating function of a geometric sequence, individually to get an expression that 

look like b\ a\ k + bi a^ ■ The problem is to find the bj ' s, which are initial terms of the generating functions and the a, ' s with are the ratios of 
successive coefficients of the generating functions. 

GS1 = PF[ [1] ] 

47 + 7V2T 



V2T(-10z + V2T+ l) 

Why isn't the first term that appears in the output for PF the same as the first part that we called GS1? It has to do with output forms in 
Mathematica and let's not get into that right now. This all works out, believe me! 

A quick word on the output from Series: If the series expansion up to degree n for a function /(z) centered at is a$ + a.\ z+ ■■■ + a n z", then 
the internal structure of the output from Series is a SeriesData expression. The variable and center of the series expansion are the first 
two arguments of this expression, and the third is the list of coefficients, which is what we will be extracting here. 
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InputForm [series [e 2z , {z, 0, 4}]] 

SeriesData[z, 0, {1, 2, 2, 4/3, 2/3}, 0, 5, 1] 

The last three argument of the output indicate the powers of the variable to attach to the variable, through 5 in steps of 1 in the example 
above. The first part of the third argument is the first term of the first geometric sequence. 

bi = Series[GSl, {z, 0, 5}] [ [3, 1] ] 



147 + 47 V 21 

21 (l + \[2\\ 

To extract the quotient of the second with the first arguments we use the pure function (tt[[3, 2]]/tt[[3, 1]]) &. This is the common ration of 
the first geometric sequence. 

a! = Series [GS1, {z, 0, 5}] // (tt[[3, 2]] /B[[3, 1]]) & 
10 



1 + ^f2^ 
We repeat the same calculations for the second geometric sequence. 

GS2 = PF[ [2] ] 

47-7 vir 



V21~(l0z + vTT-i) 

b 2 = Series[GS2, {z, 0, 5}] [ [3, 1] ] 

47 ^[2\ - 147 



2l(V"2T- l) 

a 2 = Series [GS2, {z, 0, 5}] // (tt[[3, 2]] /B[[3, 1]]) & 
10 



'21 -1 
We collect the four constants we've just computed into the sum of two geometric sequences. 

Ssol [k_] : = bi ai k + b 2 a 2 k 

Next we test our expression to see if it matches the two other lists of terms if S that we've computed. The result may be disturbing, but the 
reason for the difference is that Mathematica hold off simplifying some expressions until requested. This is the case here. 
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Map[Ssol, Range [10]] 



101147 + 47 V21 I 

21 (l + VIT) 2 

1000 (l47 + 47 V2T 
21 fl + V2T) 4 



10147 V21 -147 100 147 V 21 -147 1001147 +47 V21 

, 1- 



21 V21 -1 



21 V21 -1 



21 1 + V21 



1000 47 V 21 -147 10000 47 V21 -147 10000 147 +47 V 21 



21 V21 -1 



21 V21 - 1 



21 1 + V21 



100000 147+ 47 V 21 100000 47 V 21 -147 1000000 47 V21 -147 1000000 147 +47 V21 



21 fl + V2T) 



21 V21 -1 



21 V21 -1 



21 1 + V21 



10000000 147+ 47 V 21 10000000 47 V21 -147 100000000 47 V 21 -147 1000000001147 +47 V 21 

- , h 



21 fl + V2T) 



2l(V2T-l) 



2l(V2T-l) 



21 fl + V2T) 



1000000000 147 + 47 V 21 1000000000 47 V 21 -147 



2l(l + V"2T) 2l(V2T-l) 

10 000 000 000 (47 V2T - 147) 10 000 000 000 (l47+47 V2T 



21 1 + V21 



47 V 21 - 147 1 (-10)* a/ 21 -l) + — 1 147+ 47 V 21 |10*|1 + V 21 



21 (V2T - l) 
The simplified result should look familiar. 

Map[Ssol, Range [0, 10]] // Simplify 

(4, 3, 17, -2, 87, -97, 532, -1017, 3677, -8762, 27 147) 
Finally, the closed form expression for our sequence can be examined 
Ssol[k] 

— Ul y[2\ - \Al\(-\Qi) k iy[2\ - l] + — 

Exercises for Section 8.6 
A Exercises 

1 . If S(0) = 1000, and B(n) = 1 .05 1/n B{n - 1) + 1000 for n > 1 , compute S(20) and plot the values of B for < n < 20. 

2. Compute the a closed form expression for F(n), where F(0) = 1,F(1) = 2,F(2) = 2.5, and if k > 3, 

F(k) = 0.9 F{k - 1) + 0.52 F(k - 2) - 0.42 F(k - 3). 
As k gets large, what does F(k) tend toward? 

3. Use generating functions to find a closed form expression for 

W{Q) = \, W(l) = l 

W(n) = 3 W{n - 1) + W(n - 2) when n > 2 
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Section 8.1 

1 . Write out a recurrence relation to describe the number of n digit positive integers (in decimal form) that contain no repeating digits . 

2. The number of k subset partitions of an ^-element set, S(n, k), satisfies the recurrence relation S(n, k) = k S(n - 1 , k) + S(n - 1, k — 1) for 
n > 3 and 2 < k < n. The numbers S (n, k) are called Stirling Numbers of the Second Kind. 

(a) What are S(n, 1) and 5 (n, n)fovn> 1? 

(b) Compute S (5, 2), 5(5, 3) , and S (5, 4) . 

(c) How many partitions are there of a five-element set? 

n 

(d) What is the significance of zZ S(n, k)l 

3 . Consider the following algorithm, called Split: 

Input: a list, Lin = (a\, a-i, . . . , a„) of n numbers, where n is a natural number 
Output: a list, Lout = (b\, bi . . . , b n ) of n numbers If n < 2, then Lout := Lin 
else \n > 2) 

1. Let L 1 := {a\, 03, a 5 ...) and L2 := («2> 04, 06. ■■■) 

2. Execute Split with input L 1 and output L 1 out 

3 . Execute Split with input L2 and output L 2 out 

4. Lout := (L 1 out, L2 out), that is, the list obtained by copying L 1 out and then copying L 2 out 

(a) What is the output from Split if the input is (1, 2, 3, 4)? 

(b) What is the output from Split if the input is (0, 1, 2, ... , n — 1), where n is equal to 2r for some natural number r? Hint: the 
general answer is related to the binary representations of the numbers in the list. 

Section 8.2 

4. Prove by induction that S(k) = (2 + k) 10* is a solution of the recurrence relation S(k) = 20 S(k - 1) - 100 S(k - 2); 5(0) = 2, 5( 1) = 30. 

5 . Explain why B(n) = 1 + 2 + • • • + 2" is not a closed form expression. 

6. Between them, Abe and Zeke have n coins. They each flip one coin and if they match (both heads or both tails), then Abe keeps both coins. 
Otherwise, Zeke keeps both coins. They continue as long as both have at least a coin. Let L (njc) be the expected number of flips that take place 
in the game if Abe starts with k coins. If < k < n, then 

L(n, k)=\+ (L(n, k - 1) + L(n, k + l))/2 

(a) Justify this equation. 

(b) What are L(n, 0) and L(n, n)l 

(c) Tabulate L(n, k)forn = 2, 3, and 4. 

Section 8.3 

7. On her nth birthday, Kathryn receives n dollars from her Uncle Dave and deposits it into a special account that pays interest of 10 % each 
year. On her 21st birthday, how much does she have in the account? 

8. Suppose you borrowed $4,000 at 8% interest and you made payments of $250 per month. Let D(n) be your debt n months after taking out 
the loan. Then D(0) = 4000 and D(n) = 1 .08 D(n - 1) - 250 for n > 0. 

(a) Derive a closed form expression for D (n). 

(b) Estimate how long it would take you to pay off the loan. Use logs if necessary. A rough estimate is sufficient. 

9. (Tower of Hanoi Problem): A classic problem that can be solved using material of this section is the Tower of Hanoi puzzle. Assume that we 
have a board with three pegs mounted on it and on one of the pegs n circular disks of decreasing size (smallest on top). The problem is to 
determine how many moves it takes to move the n disks from one peg to another without placing a larger disk on a smaller one at any time. 

(a) If X(n) stands for the number of moves it takes to move n disks from one peg to another, then X(n) = 2 X(n - 1) + 1 . Justify this recurrence 
relation. 

(b) Solve the recurrence relation in part a. Verify your result, and the formula in part a, using three disks. 

Section 8.4 
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10. Suppose that T(0) = and T(o) = 1 + 7([a/2J) for a > 0. If a = (1 101001011011 110) two , what is T(d)l 

1 1 . Find a solution for n = 2 k : Q(n) = n + Q(\n = 4J), n > 1, Q (0) = 0. 

12. The recurrence relation x(n + 1) = 1 + x(n) 2 with x(0) = 1 has a solution that is approximately equal to c' <n) , where c is a constant and 

t(ri) = 2", Estimate the value of c based on values of jc(0), jc(1) , ... , x(4). Test your answer by computing x(5) and comparing it to c' <5) . The 
significance of x (n) is that it equals the number of binary trees of depth n or less (see Chapter 10) . 

Section 8.5 

13. Write out the first five terms of the generating function of sequence S where 5(0) = 1 and for k > 0, S(k) = 5(0) + • • • + S(k - 1). 

14. (a) Determine the closed form expression for the terms of sequence 5, 

where S(k) - 6S(k - 1) = for £ > 1 and 5(0) = 7. 

(b) What is the generating function, G(5; z), for sequence 5? Write out the first four terms of your answer and then write out the closed form 
expression for the generating function. 

15. Suppose that 5 is a sequence with generating function G (5; z) and that T has generating function G(5; cz). How is T related to 5? 
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chapter 9 




GRAPH THEORY 



GOALS 

This chapter has three principal goals. First, we will identify the basic components of a graph and some of the optional features that many 
graphs have. Second, we will discuss some of the questions that are most commonly asked of graphs. Third, we want to make the reader aware 
of how graphs are used to model different situations. In Section 9.1, we will discuss these topics in general, and in later sections we will take a 
closer look at selected topics in the theory of graphs. 

Chapter 10 will continue our discussion with an examination of trees, a special type of graph. 

I 9.1 Graphs— A General Introduction 

Recall that we introduced directed graphs in Chapter 6. 

Definition: Directed Graph. A directed graph consists of a set of vertices, V, and a set of edges, E, connecting certain elements of V. 
Each element of E is an ordered pair from V (i.e., an element ofVxV). The first entry is the initial vertex of the edge and the second entry is 
the terminal vertex. In certain cases there will be more than one edge between two vertices, in which cases the different edges are identified 
with labels. 

Despite the set terminology in this definition, we usually think of a graph as a picture, an aid in visualizing a situation. In Chapter 6, we 
introduced this concept to help understand relations on sets. Although those relations were principally of a mathematical nature, it remains true 
that when we see a graph, it tells us how the elements of a set are related to one another. 

Definition: Simple Graph and Multigraph. A simple graph is one for which there is no more than one vertex directed from any vertex to 
another vertex. All other graphs, ones with at least two edges from one vertex to some other vertex, are called multigraphs. 

To illustrate the points that we will make in this chapter, we will introduce the following examples of graphs. 

Example 9.1.1. A Directed Graph. Figure 9.1.1 is an example of a simple directed graph. In set terms, this graph is (V, E), where 
V = {s, a, b) and E = {(s, a), (s, b), (a, b), [b, a), (b, b)). Note how each edge is labeled either or 1. There are often reasons for 
labeling even simple graphs. Some labels are to help make a graph easier to discuss; others are more significant. We will discuss the signifi- 
cance of the labels on this graph later. 
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Figure 9.1.1 

A directed graph 

Example 9.1.2. An Undirected Graph. A network of computers can be described easily using a graph. Figure 9.1.2 describes a 
network of five computers, a, b, c, d, and e. An edge between any two vertices indicates that direct two-way communication is possible 
between the two computers. Note that the edges of this graph are not directed. This is due to the fact that the relation that is being displayed is 
symmetric (i.e., if X can communicate with Y, then Kcan communicate with X). Although directed edges could be used here, it would simply 
clutter the graph. 




Figure 9.1.2 

An undirected graph 

There are several other situations for which this graph can serve as a model. One of them is to interpret the vertices as cities and the edges as 
roads, an abstraction of a map such as the one in Figure 9.1.3. Another interpretation is as an abstraction of the floor plan of a house (Figure 
9.1 .4). Vertex a represents the outside of the house; all others represent rooms. Two vertices are connected if there is a door between them. 




■jStJsSsfiSsfffSSSJ-JStJfsSSjjJys s///^ 



Figure 9.1.3 

Road Map 



Definition: Undirected Graph. An undirected graph consists of a set V, called a vertex set, and a set E of two-element subsets of V, 
called the edge set. The two-element subsets are drawn as lines connecting the vertices. 

The undirected graph of Figure 9.1.2 is V = {a, b, c, d, e) and E = {(a, b,}, {a, d], {b, c), {b, d\, {c, e\, {b, e}}. 
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Definition: Complete Undirected Graph. A complete undirected graph of n vertices is an undirected graph with the property that each 
pair of distinct vertices are connected to one another. Such a graph is usually denoted by K n . 

Example 9.1.3. A Multigraph. A common occurrence of a multigraph is a road map. The cities and towns on the map can be thought of 
as vertices , while the roads are the edges . It is not uncommon to have more than one road connecting two cities . In order to give clear travel 
directions, we name or number roads so that there is no ambiguity. We use the same method to describe the edges of the multigraph in Figure 
9.1 .5. There is no question what e3 is; however, referring to the edge (2, 3) would be ambiguous. 




Figure 9.1.5 

A multigraph 

Example 9.1.4. A flowchart is a common example of a simple graph that requires labels for its vertices and some of its edges. Figure 
9.1 .6 is one such example that illustrates how many problems are solved. 



Q: Is the 
Problem Solved 




Figure 9.1.5 

Flowchart for the problem-solving process 

At the start of the problem-solving process, we are at the vertex labeled "Start" and at the end (if we are lucky enough to have solved the 
problem) we will be at the vertex labeled "End." The sequence of vertices that we pass through as we move from "Start" to "End" is called a 
path. The "Start" vertex is called the initial vertex of the path, while the "End" is called the final, or terminal, vertex. Suppose that the problem 
is solved after two attempts; then the path that was taken is Start, R, A, Q, L, A, Q, End. An alternate path description would be to list the 
edges that were used: 1, 2, 3, No, 4, 3, Yes . This second method of describing a path has the ad vantage of being applicable for multigraphs . 
On the graph in Figure 9 . 1 .5 , the vertex list 1, 2, 3, 4, 3 does not clearly describe a path between 1 and 3 , but e i , en , e^ , e-j is unambiguous . 

A SUMMARY OF PATH NOTATION AND TERMINOLOGY 

If x and y are two vertices of a graph, then a path between x and y describes a motion from x and y along edges of the graph. Vertex x is called 
the initial vertex of the path and y is called the terminal vertex. A path between x and y can always be described by its edge list, the list of edges 
that were used: (ei, ei, ..., e„), where : (1) the initial vertex of e\ is x; (2) the terminal vertex of e, is the initial vertex of e i+ \ , 
i = 1, 2, .... n — 1; and (3) the terminal vertex of e„ is y. The number of edges in the edge list is the path length. A path on a simple graph 
can also be described by a vertex list. A path of length n will have a list of n + 1 vertices v = x, Vi, V2, ...,v„=y, where, for 
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k = 0, 1,2, . . . , n — 1, (V£, v t+1 ) is an edge on the graph. A circuit is a path that terminates at its initial vertex. 

Suppose that a path between two vertices has an edge list (e„ e 2 , . . . ,e„). A subpath of this graph is any portion of the path described by one 
or more consecutive edges in the edge list. For example, (3, No, 4) is a subpath of (1, 2, 3, No, 4, 3, Yes). Any path is its own subpath; 
however, we call it an improper subpath of itself. All other subpaths are called proper subpaths. 

A path or circuit is simple if it contains no proper subpath that is a circuit. This is the same as saying that a path or circuit is simple if it does not 
visit any vertex more than once except for the common initial and terminal vertex in the circuit. In the problem-solving method described in 
Figure 9.1 .6, the path that you take is simple only if you reach a solution on the first try. 

Example 9.1.5. The leadership structure of a corporation is often represented with a graph as in Figure 9.1 .7. 



President 




Executive VP 



Administrative VP 



Financial VP 



Production 
Supervisor 




Office 
Manager 



Treasurer 



Figure 9.1.7 

Organization of a corporation 

The principle behind such a structure is that everyone but the president has a single immediate supervisor. Any action that anyone takes can 
reach the president only through a unique "chain of command." This chain-of-command property is characteristic of a special type of graph 
called a tree. Note that the edges of this graph are not directed, but, as in a Hasse diagram, the relation between two connecting vertices is clear: 
the top vertex is the supervisor of the lower vertex. 

The process of structured (or top-down) problem solving results in a graph that is similar to this tree. Starting with the top of the tree, which 
would represent the whole problem, the problem is divided into a sequence of separate subproblems. Each subproblem is divided further into 
smaller sub-problems in the same way until the solutions of the lowest problems are easy enough to recognize. 

From these examples, we can see that although a graph can be defined, in short, as a collection of vertices and edges, an integral part of most 
graphs is the labeling of the vertices and edges that allows us to interpret the graph as a model for some situation. 

Example 9.1.6. A Graph as a Model for a Set of Strings. Suppose that you would like to mechanically describe the set of strings of 0's 
and l's having no consecutive l's, One way to visualize a string of this kind is with the graph in Figure 9.1.1. Consider any path starting at 
vertex s. If the label on each graph is considered to be the output to a printer, then the output will have no consecutive l's. For example, the 
path that is described by the vertex list (s, a, b, b, a, b, b, a, b) would result in an output of 10010010. Conversely, any string with no 
consecutive l's determines a path starting at s. 

Example 9.1.7. A Tournament Graph. Suppose that four teams compete in a round-robin sporting event; that is, each team meets every 
other team once, and each game is played until a winner is determined. If the teams are named A, B, C, and D, we can define the relation yS on 
the set of teams by X ft Y if X beat Y. For one set of results , the graph of /3 might look like Figure 9.1.8. 
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Figure 9.1.8 

Round-robin graph with four vertices 



Definition: Tournament Graph. 



(a) A tournament graph is a directed graph with the property that no edge connects a vertex to itself, and between any two vertices there is at 
most one edge. 

(b) A complete (or round-robin) tournament graph is a tournament graph with the property that between any two distinct vertices there is 
exactly one edge. 

(c) A single-elimination tournament graph is a tournament graph with the properties that: (i) one vertex (the champion) has no edge 
terminating at it and at least one edge initiating from it; (ii) every other vertex is the terminal vertex of exactly one edge; and (Hi) there is a 
path from the champion vertex to every other vertex. 

Example 9.1.8. The major league baseball championship is decided with a single-elimination tournament, where each "game" is actually 
a series of games. Until 1995, the two divisional champions in the American League (East and West) compete in a series of games. The loser 
is eliminated and the winner competes against the winner of the National League series (which is decided as in the American League) . The 
tournament graph of the 1983 championship is in Figure 9.1.9. 



Baltimore 
AL East 



Chicago 
AL West 



Philadelphia 
NL East 



Los Angeles 
NL West 



Figure 9.1.9 

1983 Major League Baseball Championship 

The question "Once you have a graph, what do you do with it?" might come to mind. The following list of common questions and comments 
about graphs is a partial list that will give you an overview of the remainder of the chapter. 
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Question 1. How can a graph be represented as a data structure for use on a computer? We will discuss some common Pascal data structures 
that are used to represent graphs in Section 9.2. 

Question 2. Given two vertices in a graph, does there exist a path between them? The existence of a path between any or all pairs of vertices in 
a graph will be discussed in Section 9.3. A related question is: How many paths of a certain type or length are there between two vertices? 

Question 3. Is there a path (or circuit) that passes through every vertex (or uses every edge) exactly once? Paths of this kind are called traver- 
sal. We will discuss traversals in Section 9.4. 

Question 4, Suppose that a cost is associated with the use of each vertex and/or edge in a path. What is the "cheapest" path, circuit, or traversal 
of a given kind? Problems of this kind will be discussed in Section 9.5. 

Question 5. Given the specifications of a graph, or the graph itself, what is the best way to draw the graph? The desire for neatness makes this a 
reasonable question. Another goal might be to avoid having edges of the graph cross one another. This is discussed in Section 9.6. 

ISOMORPHIC GRAPHS 

We will close this section by establishing the relation "is isomorphic to," a form of equality on graphs. The graphs in Figure 9.1.10 obviously 
share some similarities, such as the number of vertices and the number of edges. It happens that they are even more similar than just that. If the 
letters a, b, c, and d in a are replaced with the numbers 1,3,4, and 2, respectively, and they are moved around so that they appear as in b, you 
obtain b. 





Figure 9.1.10 

Two Isomorphic Graphs 

Here is a more precise definition that reflects the fact that the actual positioning of vertices isn't an essential part of a graph. 

Definition: Isomorphic Graphs. Two graphs (V, E) and (V, E') are isomorphic if there exists a bijection f:V->V such that 
(\'j, Vjj e E if and only if(f(Vi), /(v/J) £ E'. For multigraphs, we add that the number of edges connecting v, to vj, must equal the number of 
edges from /(v,) to f(vj). 

Degrees and Graphic Sequences 

The most significant local characteristic of a vertix within a graph is its degree. Collectively, the degrees can partially characterize a graph. 
Definition: Degree. 

(a) Let v be a vertex of an undirected graph. The degree ofv, denoted deg(v), is the number of edges that connect v to the other vertices in 
the graph. 

(b) If v is a vertex of a directed graph, then the outdegree ofv, denoted outdeg(v), is the number of edges of the graph that initiate at v. The 
indegree ofv, denoted indeg(v). is the number of edges that terminate at v. 

Example 9.4.9. 

(a) The degrees of vertices 1 through 5 in Figure 9.1.11 are 2,3,4, 1 , and 2, respectively. 
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Figure 9. 1.11 

(b) In a tournament graph, outdeg (v) is the number of wins for v and indeg (v) is the number of losses. In a complete (round-robin) tour- 
nament graph with n vertices, outdeg (v) + indeg (v) = n - 1 for each vertex. 

Graphic Sequences 

A finite nonincreasing sequence of integers d\ , d.2 , . . . , d„ is a graphic if there exists a simple graph with n vertices having the sequence as its 
degree sequence. For example, 4,2, 1, 1, 1, lis graphic because the degrees of the following graph match these numbers. 




4" 5' 

Note: There is no connection between the vertex name/number and its degree. 



EXERCISES FOR SECTION 9.1 
A Exercises 

1. What is the significance of the fact that there is a path connecting vertex b with every other vertex in Figure 9.1.2, as it applies to various 
situations that it models? 

2. Draw a graph similar to Figure 9.1.1 that represents the set of strings of 0's and l's containing no more than two consecutive l's. 

3. Draw a directed graph that models the set of strings of 0's and l's where all of the l's must appear consecutively . 

4. In the NCAA final-four basketball tournament, the East champion plays the West champion, and the champions from the Mideast and 
Midwest play. The winners of the two games play for the national championship. Draw the eight different single-elimination tournament graphs 
that could occur. 

5 . What is the maximum number of edges in a simple undirected graph with eight vertices? 

6. Which of the graphs in Figure 9.1.11 are isomorphic? What is the correspondence between their vertices? 
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(a) 



(b) 



(c) 
4 



1 -2-3-4-5 



(d) 
2\ \3 




(f) 



(h) 



(g) 

1 



(i) 

3 



[4j 5 

Figure 9.2.11 

7. (a) How many edges does a complete tournament graph with n vertices have? 

(b) How many edges does a single-elimination tournament graph with n vertices have? 

8. Draw complete undirected graphs with 1, 2, 3, 4, and 5 vertices. How many edges does a^„a complete undirected graph with n vertices, 
have? 

9 . Determine whether the following sequences are graphic . Explain your logic . 

(a) 6,5,4,3,2,1,0 

(b) 2,2,2,2,2,2,2 
(c)3,2, 2, 2, 2, 2 
(c)5, 3,3, 3,3, 3 

(e) 1, 1, 1, 1, 1, 1 

(f) 5,5,4, 3,2, 1 

10. (a) Based on observations you might have made in exercise 9, describe as many characteristics as you can about graphic sequences of 
length n . 

(b) Consider the two graphs 






and 




Notice that they have the same degree sequences, 2, 2, 2, 2, 2, 2. Explain why the two graphs are not isomorphic. 
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I 9.2 Data Structures and Computer Generation of Graphs 

In this section, we will describe data structures that are commonly used to represent graphs. In addition we will introduce the basic syntax for 
graphs in Mathematica and Sage. 

Assume that we have a graph with n vertices that can be indexed by the integers 1,2, . . . , n. 

Data Structure 1: Adjacency Matrix. As we saw in Chapter 6, the Information about edges in a graph can be summarized with an adjacency 
matrix, G, where Gy = 1 if and only if vertex i is connected to vertex j in the graph. Note that this is the same as the adjacency matrix for a 
relation, with the exception. 

Data Structure 2: Edge List I. Note that the initializing procedure for an adjacency matrix presumes that a list of edges for the graph exists. 
This second data structure maintains this list form. For each vertex in our graph, there will be a list of edges that initiate at that vertex. If G 
represents the graph's edge information, then G, would be the list of edges initiating at vertex i. 

Data Structure 3: Edge List II. An even simpler way to represent the edges is to maintain a list of ordered pairs. 





Figure 9.2.1 



Example 9.2.1. Given the graph in Figure 9.2.1 , the adjacency matrix that represents the graph would be 



C, 



(0 1 1 -\ 

11 
10 

U o o o j 

The same graph would be represented with the edge list of type 1 : 

G,={{2,4),{3,4), (3), {1}) 
Finally, the list 



G,={{1,2),{1,4), (2,3},{2,4), {3,3), {4, 1}} 

describes the same graph with an edge list of type 2. 

A natural question to ask is: Which data structure should be used in a given situation? For small graphs, it really doesn't make much difference, 
but since answers the question "Is there an edge from vertex i to vertex j is easiest with an adjacency matrix, the adjacency matrix would be the 
natural choice. For larger matrices the edge count would be a consideration. If n is large and the number of edges is relatively small, it might 
use less memory to maintain a list of edges instead of building ainxn matrix. Most software for working with graphs will make the decision 
for you. 

Mathematica Note 

First, a short history of graphs in Mathematica. Until Mathematica 6.0, all graph related functions were part of a package called Combinator- 
ica. A few graph related functions, notably GraphPlot, were introduced in version 6.0. The output from GraphPlot was (and still is) a 
Graphics object. Starting in Mathematica 8.0, graphs became native Mathematica objects. Graph expressions are backwards compatible in that 
Combinatorica is still available. In addition, GraphPlot is still a good way to fine-tune the way a graph appears. 

The graph in Example 9.2.1 was drawn by evaluating the following expression. 
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d = GraphPlot [ { 1 -» 2 , 2 -» 3 , 2 -» 4 , 4 -» 1 , 1 -> 4 , 3-»3}, 
VertexLabeling -» True, DirectedEdges -> True] 





The same graph, with a different embedding and style of displaying vertices can be produced using Graph. 

Gi = Graph [{1, 2, 3, 4}, {1 -» 2, 2 -> 3, 2 -» 4, 4 -» 1, 1 -» 4, 3 -» 3}, 
VertexLabels -» "Name", ImagePadding -» 5] 





From both expressions one sees that an edge list in the form of Mathematica rules is used to specify the edges. Using GraphPlot, the 
vertices are implicit from the numbers that appear among the edges . This has the drawback of not allowing for an isolated vertex without any 
edges connected to it. A second acceptable way to specify edges is using GraphPlot is with an adjacency matrix: 



G 2 = 




10 
10 
10 
10 1 



1 ^ 





I 



GraphPlot [G 2 , VertexLabeling -> True, DirectedEdges -» True] 




Notice that in these examples, the placement of the vertices is determined by Mathematica. Both functions have options that do allow for 
placement. 

An example of a system graph function is GraphDistance. For example, we can ask for the distance from vertex 1 to vertex 3 in G\ and 
find that one needs to travel along two edges to go from 1 to 3. 

GraphDistance [Gi, 1,3] 



In G[ , you can't get from vertex 3 to vertex 4, so the distance is infinite: 
GraphDistance [Gi, 3,4] 
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We can get the adjacency matrix of Gi or the edge list of Gi with package function 

Ad j acencyMatr ix [Gi ] 

SparseArray[<6>, {4, 4}] 

In practice, graphs are often quite large, so Mathematica automatically uses a sparse array data structure to store the adjacency matrix. To see 
the actual matrix you can use the Normal function: 

Normal [Adj acencyMatr ix [G x ] ] 

(0 1 1\ 

11 
10 

U oo o| 

Larger graphs can be generated by creating lists of edges using rules. For example, here is graph of all divisors of 6 != 720 with an edge 

j 72 

connecting j to k if - is a prime. For example, there is an edge connecting 72 to 24 because — = 3 is prime. The code for creating G 3 may 
look daunting, but essentially the set of ordered pairs in the set 

Divisors[6!] = {1, 2, 3, 4, 5, 6, 8, 9, 10, 12, 15, 16, 18, 20, 24, 30, 36, 40, 45, 48, 60, 72, 80, 90, 120, 144, 180, 240, 360, 720) 

and the relatively few ordered pairs where the first number divided by the second number is prime are selected out. These ordered pairs are 
then converted to a rule: (a, b) is converted to a -» b. 

edges = Map [Rule @@tt &, Select [Tuples [Divisors [6 !] , 2], PrimeQ [Divide @@n] &] ] ; 
G 3 = Graph [edges, VertexLabels -» "Name", ImagePadding -» 5] 




The adjacency matrix of G3 can certainly be considered sparse since it has relatively few l's. 
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AdjacencyMatrix [G 3 ] //Normal 
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pu| Sage Note 

Sage graphs are specified using version 1 of edge lists. Here is how the graph in Example 9.2.1 is generated and then displayed. 



sage : Gl 
Gl .show( ) 



DiGraph( { 1 : [4, 2], 2 : [3, 4], 3 : [3], 4 



[I]}! 




There are many special graphs and graph families that are available in Sage through the graphs module. They are referenced with the prefix 
graphs . followed by the name and zero or more paramenters inside parentheses. Here are a couple of them, first a complete graph with five 
vertices . 
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sage : graphs .CompleteGraph( 5) .show( ' 




Here is a wheel graph, named for an obvious pattern of vertices and edges. We assign a name to it first and then show the graph without 
labeling the vertices . 

sage: w=graphs .WheelGraph( 20) 
sage : w. show( vertex_labels=f alse ) 




There are dozens of graph methods, one of which determines the degree sequence of a graph. 



sage: w.degree_sequence( ) 

I <L> i "/ ^ f ^ f ^ f "f "/ ^ f ^ f ^ 9 ^ f f *^f ^ 9 ^ f ^ f *^/ 9 *^/ J 



The degree sequence method is defined within the graphs module, but the prefix graphs . isn't needed because the value of w inherits the 
graphs methods. 
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Example 9.2.2. Consider an directed graph represented by the Division I NCAA college basketball teams in the United States for a 
given year. There are approximately 350 teams in Division 1 . Suppose we constructed the graph with an edge from team A to team B if A 
beat B at least once in the season; and we label the edge with the number of wins. Since the average team plays around 30 games in a 

season, most of which will be against other Division I teams, we could expect around : — = 5250 edges in the graph (this is somewhat 

reduced by games with lower division teams and cases where two or more wins over the same team produces one edge). Since 5250 is much 
smaller than 350 2 = 122500 entries in an adjacency matrix, we would consider this a sparse graph and would expect to find one of the edge 
list structures most efficient. Even if an adjacency matrix is created, as could be done in Mathematical, it would be represented using a sparse 
matrix data structure. 

EXERCISES FOR SECTION 9.2 
A Exercises 

1 . Estimate the number of vertices and edges in each of the following graphs. Would the graph be considered sparse? 

(a) Vertices: Cities of the world that are served by at least one airline. 
Edges: Pairs of cities that are connected by a regular direct flight. 

(b) Vertices: ASCII characters. 

Edges: connect characters that differ in their binary code by exactly two bits. 

(c) Vertices: All English words. 

Edges: An edge connects word x to word y if x is a prefix of y . 

2. Each edge of a graph is colored with one of the four colors red, blue, yellow, or green. How could you represent the edges in this graph 
using a variation of the adjacency matrix structure? 

3. Directed graphs G\ , . . . , Gs , each with vertex set { 1 , 2, 3, 4, 5) are represented by the matrices below. Which graphs are isomorphic? 



/0 1 000 

10 

10 

1 

U oooo 

(01111 





10 

10 



G 2 : 



A 

10 



1110 1 



(0 1 

10 10 
1 

10 1 



Gy. 





10 1 

10 

10 

1 J 

(0 1 

110 
10 

10 1 ) 



4. The following Sage command verifies that the wheel graph with four vertices is isomorphic to the complete graph with four vertices. 

graphs .WheelGraph( 4) . is_isomorphic( graphs .CompleteGraph( 4) ) 
Evaluate the expression dir ( graphs . ) in Sage to get a list of graph names, and then find two more pairs of isomorphic graphs. 
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I 9.3 Connectivity 



This section is devoted to a question that, when posed in relation to the graphs that we have examined, seems trivial. That question is: Given 
two vertices, s and t, of a graph, is there a path from s to t? If s = t, this question is interpreted as asking whether there is a circuit of positive 
length starting at s. Of course, for the graphs we have seen up to now, this question can be answered after a brief examination. 

There are two situations under which a question of this kind is nontrivial. One is where the graph is very large and an "examination" of the 
graph could take a considerable amount of time. Anyone who has tried to solve a maze may have run into a similar problem. The second 
interesting situation is when we want to pose the question to a machine. If only the information on the edges between the vertices is part of the 
data structure for the graph, how can you put that information together to determine whether two vertices can be connected by a path? 

Connectivity Terminology. Let v and w be vertices of a directed graph. Vertex v is connected to vertex w if there is a path from v to w. Two 
vertices are strongly connected if they are connected in both directions to one another. A graph is connected if, for each pair of distinct vertices, 
v and w, v is connected to w or w is connected to v. A graph is strongly connected if every pair of its vertices is strongly connected. For an 
undirected graph, in which edges can be used in either direction, the notions of strongly connected and connected are the same. 

Theorem 93.1. If a graph has n vertices and vertex u is connected to vertex w, then there exists a path from u to w of length no more 
than n. 

Proof (Indirect): Suppose u is connected to w, but the shortest path from atow has length m, where m> n. A vertex list for a path of 
length m will have m + 1 vertices. This path can be represented as (vo, vi, . . ., v m ), where vo = u and v m = w. Note that since there are only 
n vertices in the graph and m vertices are listed in the path after Vo, we can apply the pigeonhole principle and be assured that there must be 
some duplication in the last m vertices of the vertex list, which represents a circuit in the path. This means that our path of minimum length 
can be reduced, which is a contradiction. ■ 

Methods for Testing Connectivity 
Questions: 

Question 1: Given a graph and two vertices in the graph, is there a path from the first vertex to the second? 

Question 2: If the answer to Question 1 is "yes" then what is the path? 

Some Answers 

Method 1: Adjacency Matrix Method. Suppose that the information about edges in a graph is stored in an adjacency matrix, G. The relation, 
r, that G defines is v r w if there is an edge connecting v to w. Recall that the composition of r with itself, r 2 , is defined by v r 2 w if there exists a 
vertex y such that vry and yrw; that is, v is connected to w by a path of length 2. We could prove by induction that the relation r k , k> 1, is 

defined by vr k w if and only if there is a path of length k from v to w. Since the transitive closure, r + , is the union of r, r 2 , r 3 , ..., we can 
answer our connectivity question by determining the transitive closure of r, which can be done most easily by keeping our relation in matrix 
form. Theorem 9.3.1 is significant in our calculations because it tells us that we need only go as far as E" to determine the matrix of the 
transitive closure. 

The main advantage of the adjacency matrix method is that the transitive closure matrix can answer all questions about the existence of paths 
between vertices. If G + is the matrix of the transitive closure, v,, is connected to v, if (E + ), ; - = 1 . A directed graph is connected if (E + ) t j = 1 or 

(E + )ji = 1 for each i + j. A directed graph is strongly connected if its transitive closure matrix has no zeros. 

A disadvantage of the adjacency matrix method is that the transitive closure matrix tells us whether a path exists, but not what the path is. 

Method 2: Broadcasting. We will describe this method first with an example. 

Example 9.3.1. The football team at Mediocre State University (MSU) has had a bad year, 2 wins and 9 losses. Thirty days after the end of the 
football season, the university trustees is meeting to decide whether to rehire the head coach; things look bad for him. However, on the day of 
the meeting, the coach releases the following list of results from the past year: 

Mediocre State defeated Local A&M. 

Local A&M defeated City College. 

City College defeated Corn State U. 

... (25 results later) 

Tough Tech defeated Enormous State University (ESU). 

And ESU went on to win the national championship! 

The trustees were so impressed that they rehired the coach with a raise in pay! How did the coach come up with such a list? 

In reality, such lists exist occasionally and appear in newspapers from time to time. Of course they really don't prove anything since each team 
that defeated MSU in our example above can produce a similar chain of results. Since college football records are readily available, the coach 
could have found this list by trial and error. All that he needed to start with was that his team won at least one game. Since ESU lost one game, 
there was some hope of producing the chain. 

The problem of finding this list is equivalent to finding a path in the tournament graph for last year's football season that initiates at MSU and 
ends at ESU. Such a graph is far from complete and would be represented using edge lists. To make the coach's problem interesting, let's 
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imagine that only the winner of a game remembers the result of the game. The coach's problem has now taken on the flavor of a maze. To reach 
ESU, he must communicate with the various teams along the path. One way that the coach could have discovered his list in time is by sending 
the following emails to the coaches of the two teams that MSU defeated during the season: 

When this example was first written, we commented that ties should be ignored. Most recent NCAA rules call for a tiebreaker in college football and so ties are no 
longer an issue. Email was also not common and we described the process in terms of letter, not email messages. The coach could also have asked the MSU math 
department to use Mathematics or Sage to get the path! 

Dear Football Coach: 

Please follow these directions exactly. 

(1) If you are the coach at ESU, call the coach at MSU now and tell him who sent 
you this message. 

(2) If you are not the coach at ESU and this is the first message of this type that 
you have received, then: 

(a) Remember who you received this message from. 

(b) Forward a copy of this message, signed by you, to each of the coaches 
whose teams you defeated during the past year. 

(3) Ignore this message if you have received one like it already. 

Signed, 

Coach of MSU 

Observations: From the conditions of this message, it should be clear that if everyone cooperates and if coaches participate within a day of 
receiving the message: 

(a) If a path of length n exists from MSU to ESU, then the coach will know about it in n days. 

(b) By making a series of phone calls, the coach can obtain the path that he wants by first calling the coach who defeated ESU (the person who 
sent ESU's coach that message). This coach will know who sent him a letter, and so on. Therefore, the vertex list of the desired path is obtained 
in reverse order. 

(c) If a total of M football games were played, no more than M letters will be sent out. 

(d) If a day passes without any letter being sent out, no path from MSU to ESU exists. 

(e) This method could be extended to obtain a list of all teams that a given team can be connected to. Simply imagine a series of letters like the 
one above sent by each football coach and targeted at every other coach. 

The general problem of finding a path between two vertices in a graph, if one exists, can be solved exactly as we solved the problem above. The 
following algorithm is commonly called a breadth-first search. 

Algorithm 9.3.1. A broadcasting algorithm for finding a path between vertex i and vertex j of a graph having n vertices. The each item V k of a 
list V = {V[, V 2 , ..., V„}, consist of a Boolean field V^.found and an integer field V^.from. The sets D\, D 2 , ..., called depth sets, have the 
property that if k is in D r , then the shortest path from vertex i to vertex k is of length r. In Step 5, a stack is used to put the vertex list for the 
path from the vertex i to vertex j in the proper order. 

1. Set the value V k . found equal to False, k = 1, 2, ... , n 

2. r = 

3. D = {i} 

4. While (-> V-j. found) and (D r #0) 

4 . 1 D r+1 = 

4.2 For each k in D r do 

For each edge (k,t) do 
If V t . found == False 
then V t . found = True 
V t .from = k 
£ r+ l = -D r+1 (J {t} 

4.3 r = r + 1 

5. If Vj. found = True Then 

5.1 S = Empty Stack 

5.2 k=j 

5.3 While V^-from 4 i 

5.3.1 Push k onto S 

5.3.2 k = V^.from 

Notes on Algorithm 9.3.1: 

(a) This algorithm will produce one path from vertex i to vertex j, if one exists, and that path will be as short as possible. If more than one path 
of this length exists, then the one that is produced depends on the order in which the edges are examined and the order in which the elements of 
D r are examined in Step 4. 

(b) The condition D r + is analogous to the condition that if no mail is sent in a given stage of the process, in which case MSU cannot be 
connected to ESU. 

(c) This algorithm can be easily revised to find paths to all vertices that can be reached from vertex i. Step 5 would be put off until a specific 
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path to a vertex is needed since the information in V contains an efficient list of all paths. The algorithm can also be extended further to find 
paths between any two vertices . 

Example 9.3.2. Consider the graph in Figure 9.3.1. The existence of a path from vertex 2 to vertex 3 is not difficult to determine by 
examination. After a few seconds, you should be able to find two paths of length four. Algorithm 9.3.1 will produce one of them. 




Figure 9.3.1 

Suppose that the edges from each vertex are sorted in ascending order by terminal vertex. For example, the edges from vertex 3 would be in 
the order (3, 1), (3, 4), (3, 5). In addition, assume that in the body of Step 4 of the algorithm, the elements of D, are used in ascending 
order. Then at the end of Step 4, the value of V will be 



k 1 


2 


3 


4 


5 


6 


V k .found T 


T 


T 


T 


T 


T 


V^from 2 


4 


6 


1 


1 


4 


Depth set 1 


3 


4 


2 


2 


3 



(value of r for which k e D r ) 

Therefore, the path (2, 1, 4, 6, 3) is produced by the algorithm. Note that if we wanted a path from 2 to 5, the information in V produces the 
path (2, 1, 5) since V^.from = 1 and Vj.from = 2. A shortest circuit that initiates at vertex 2 is also available by noting that V 2 -from = 4, 
V 4 .from = 1 , and V[.from = 2; thus the circuit (2, 1, 4, 2) is obtained. 

jflfr Mathematics Note 

Consider the following defined by a adjecency matrix. 

SeedRandom[2014] ; 

adj = Randomlnteger [{0, 1} , {8, 8}] * Table [Boole [i ^ j] , {i, 1, 8}, {j, 1, 8}] 

fO 1 10 10 

10 

1 10 10 
10 110 1 

10 10 111 
10 110 10 

1 1 10 
{0 1 1 ) 

g = AdjacencyGraph [adj , VertexLabels -» "Name", ImagePadding -» 5] 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3 .0 United States 



191 



Chapter 9 - Graph Theory 



191 




Is 1 is connected to 8 in this graph? The function FindShortestPath will give us a shortest: 

FindShortestPath[g, 1, 8] 

{1,2,4, 8) 
If we are interested in reaching vertex 8 from all vertices, we can do that just as easily. 

Map[{B, FindShortestPath [g, All, 8] [B] } &, Range[8]] 

(I (1,6,4,8) 

2 (2,4,8) 

3 (3,2,4,8) 

4 (4,8) 

5 (5,8) 

6 (6,4,8) 

7 (7,2,4,8) 

8 (8) 

Finally, here is a matrix of shortest paths between any two vertices in the graph. 

Map [Apply [FindShortestPath [g, All, All], a] &, Outer [List, Range[l, 8], Range[l, 8]], {2}] 

(1,2,4) (1,2,4,5) (1,6) 

(2,4) (2,4,5) (2,4,6) 

(3,2,4) (3,2,4,5) (3, 1,6) 

(4) (4, 5) (4, 6) 

(5,4) (5) (5,6) 

(6,4) (6,4,5) (6) 

(7,2,4) (7,2,4,5) (7, 1,6) 

(8,5,4) (8,5) (8,5,6) 

If no path exists between two vertices, the result is an empty list, as illustrated by this simple example. 

h = Graph [{1 -» 2, 2-»3, 3 -> 4}] ; 
FindShortestPath [h, 4, 1] 



(1) 


(1,2) 


(1,3) 


(2,4,3, 1) 


(2) 


(2,4,3) 


(3,1) 


(3,2) 


(3) 


(4,3,1) 


(4,3,2) 


(4,3) 


(5,7,1) 


(5,2) 


(5,8, 3) 


(6,1) 


(6,1,2) 


(6,3) 


(7,1) 


(7,2) 


(7,3) 


(8,3, 1) 


(8,3,2) 


(8,3) 



(1,6,7) 


(1,2,4, 8) 


2,4, 3,7) 


(2,4,8) 


(3,7) 


(3,2,4, 8) 


(4,3,7) 


(4,8) 


(5,7) 


(5,8) 


(6,7) 


(6,4,8) 


(7) 


(7,2,4, 8) 


(8, 3,7) 


(8) 
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Z#S\ Sage Note 

tSSt 

Here is some sage code that 

1. Generates a random undireced graph with 18 vertices. For each pair of vertices, an edge is included between them with probability 0.2. 
18' 



Since there are =153 potential edges, we expect that there will be approximately 0.2 x 153 « 31 edges. 

2. Counts the number of edges. In this case the number is a bit less than expected. 

3 . Finds a shortest path from vertex to vertex 8 . 

4. Generates a list of vertices that would be reached in a breadth-first search. The expression Gr . depth_f irst_search ( ) creates an 
iterator that is convenient for programming. Wrapping list( ) around the expression shows the order in which the vertices are visited. 

5. Generates a list of vertices that would be reached in a depth-first search. In this type of search you travel in one direction away from the 
starting point until no further new vertices. We will discuss this search later. 

sage : Gr=graphs .RandomGNP( 18,0.2); 
Gr .show( ) 




sage: len(Gr .edges( labels=False) ) 

25 

sage: Gr .shortest_path( 0, 8) 

[0, 10, 14, 8] 

sage : list(Gr .breadth_f irst_search( 0) ) 

[0, 17, 10, 11, 4, 13, 14, 3, 15, 7, 8, 9, 16, 1, 12, 2, 6, 5] 

sage : list(Gr .depth_f irst_search( 0) ) 

[0, 11, 15, 12, 5, 6, 7, 14, 10, 9, 3, 4, 17, 13, 1, 16, 8, 2] 



EXERCISES FOR SECTION 9.3 
A Exercises 

1. Apply Algorithm 9.3.1 to find a path from 5 to 1 in Figure 9.3.1. What would be the final value of V? Assume that the terminal vertices in 
edge lists and elements of the depth sets are put into ascending order, as we assumed in Example 9.3.1 . 

2. Apply Algorithm 9.3.1 to find a path from the bedroom to outside using the edge list data structure in Example 9.2.1. Assume that the 
elements of the depth sets are put into ascending order. 

3. In a simple undirected graph with no self-loops, what is the maximum number of edges you can have, keeping the graph unconnected? 
What is the minimum number of edges that will assure that the graph is connected? 

4. Use a broadcasting algorithm to determine the shortest path from vertex a to vertex i in the graphs shown in Figure 9.3.2. List the depth sets 
and the stack that is obtained. 
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(a) 




(b) 




S 



Figure 9.3.2 

Exercise 4 



B Exercise 

5 . Prove (by induction on k) that if the relation r on vertices of a graph is defined by v r w if there is an edge connecting v to w , then r k ,k > 1 , 
is defined by v r k w if there is a path of length k from vtow. 
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I 9.4 Traversal: Eulerian and Hamiltonian Graphs 



The subject of graph traversals has a long history. In fact, the solution by Leonhard Euler (Switzerland, 1707-83) of the Konigsberg Bridge 
Problem is considered by many to represent the birth of graph theory. 



The Konigsberg Bridge Problem and Eulerian Graphs 



kaWIHCi.t.'HHCA 




Figure 9.4.1 

Map of Konigsberg 

A map of the Prussian city of Konigsberg (circa 1735) in Figure 9.4.1 shows that there were seven bridges connecting the four land masses that 
made up the city. The legend of this problem states that the citizens of Konigsberg searched in vain for a walking tour that passed over each 
bridge exactly once. No one could design such a tour and the search was abruptly abandoned with the publication of Euler's Theorem. 

Theorem 9.4.1: Euler' s Theorem — Konigsberg Case. No walking tour of Konigsberg can be designed so that each bridge is used 
exactly once. 




Figure 9.4.2 

Multigraph representation of Konigsberg 

Proof: The map of Konigsberg can be represented as an undirected multigraph, as in Figure 9.4.2. The four land masses are the vertices 
and each edge represents a bridge. The desired tour is then a path that uses each edge once and only once. Since the path can start and end at 
two different vertices, there are two remaining vertices that must be intermediate vertices in the path. If x is an intermediate vertex, then every 
time that you visit x, you must use two of its incident edges, one to enter and one to exit. Therefore, there must be an even number of edges 
connecting x to the other vertices . Since every vertex in the Konigsberg graph has an odd number of edges , no tour of the type that is desired 
is possible. ■ 

As is typical of most mathematicians, Euler wasn't satisfied with solving only the Konigsberg problem. His original theorem, which is para- 
phrased below, concerned the existence of paths and circuits like those sought in Konigsberg. These paths and circuits have become associated 
with Euler's name. 

Definitions: Eulerian Paths, Circuits, Graphs. A Eulerian path through a graph is a path whose edge list contains each edge of the 
graph exactly once. If the path is a circuit, then it is called a Eulerian circuit. A Eulerian graph is a graph that possesses a Eulerian path. 

Example 9.4.1. Without tracing any paths, we can be sure that the graph below has an Eulerian circuit because all vertices have an even 
degree. This follows from the following theorem. 
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Theorem 9.4.2: Euler's Theorem — General Case. An undirected graph is Eulerian if and only if it is connected and has either zero or 
two vertices with an odd degree. If no vertex has an odd degree, then the graph has a Eulerian circuit. 

Proof: It can be proven by induction that the number of vertices in an undirected graph that have an odd degree must be even. We will leave the 
proof of this fact to the reader as an exercise. The necessity of having either zero or two vertices of odd degree is clear from the proof of the 
Konigsberg case of this theorem. Therefore, we will concentrate on proving that this condition is sufficient to ensure that a graph is Eulerian. 
Let k be the number of vertices with odd degree. 

Phase 1. If k = 0, start at any vertex, Vn, and travel along any path, not using any edge twice. Since each vertex has an even degree, this path 
can always be continued past each vertex that you reach except vq. The result is a circuit that includes Vo. If k = 2, let vo be either one of the 
vertices of odd degree. Trace any path starting at Vo using up edges until you can go no further, as in the k = case. This time, the path that 
you obtain must end at the other vertex of odd degree that we will call V] . At the end of Phase 1 , we have an initial path that may or may not 
be Eulerian. If it is not Eulerian, Phase 2 can be repeated until all of the edges have been used. Since the number of unused edges is decreased 
in any use of Phase 2, a Eulerian path must be obtained in a finite number of steps. 
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Phase 2. As we enter this phase, we have constructed a path that uses a proper subset of the edges in our graph. We will refer to this path as the 
current path. Let V be the vertices of our graph, E the edges, and E„ the edges that have been used in the current path. Consider the graph 
G' = (V ', E — E u ). Note that every vertex in G' has an even degree. Select any edge, e, from G'. Let v a and v b be the vertices that e connects. 
Trace a new path starting at v a whose first edge is e. We can be sure that at least one vertex of the new path is also in the current path since 
(V, E) is connected. Starting at v a , there exists a path in (V ', E) to any vertex in the current path. At some point along this path, which we can 
consider the start of the new path, we will have intersected the current path. Since the degree of each vertex in G' is even, any path that we start 
at v a can be continued until it is a circuit. Now, we simply augment the current path with this circuit. As we travel along the current path, the 
first time that we intersect the new path, we travel along it (see Figure 9.4.3). Once we complete the circuit that is the new path, we resume the 
traversal of the current path. 



1 i •- 




CurraH path 

New path 

New cuiTenl path 



Figure 9.4.3 

Augmenting the current path in the proof of Theorem 9.4.2 

If the result of this phase is a Eulerian path, then we are finished; otherwise, repeat this phase. # 

Example 9.4.2. The complete undirected graphs K 2 a.ndK 2n+ i,n = 1, 2, 3, , are Eulerian. If n > 1, then K 2 „ is not Eulerian. 

HAMILTONIAN GRAPHS 

To search for a path that uses every vertex of a graph exactly once seems to be a natural next problem after you have considered Eulerian 
graphs. The Irish mathematician Sir William Hamilton (1805-65) is given credit for first defining such paths. He is also credited with discover- 
ing the quaternions, for which he was honored by the Irish government with a postage stamp in 2004. 




Definition: Hamiltonian Paths, Circuits, and Graphs. A Hamiltonian path through a graph is a path whose vertex list contains each 
vertex of the graph exactly once, except if the path is a circuit, in which case the initial vertex appears a second time as the terminal vertex. If 
the path is a circuit, then it is called a Hamiltonian circuit. A Hamiltonian graph is a graph that possesses a Hamiltonian path. 

Example 9.4.3. Figure 9.4.4 shows a graph that is Hamiltonian. In fact, it is the graph that Hamilton used as an example to pose the 
question of existence of Hamiltonian paths in 1859. In its original form, the puzzle that was posed to readers was called 'Around the World." 
The vertices were labeled with names of major cities of the world and the object was to complete a tour of these cities. The graph is also 
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referred to as the dodecahedron graph, where vertices correspond with the corners of a dodecahedron and the edges are the edges of the solid 
that connect the corners . 




Figure 9.4.4 

The dodecahedron graph, a Hamiltonian graph. 




Figure 9.4.5 

The regular dodecahedron 

Unfortunately, a simple condition doesn't exist that characterizes a Hamiltonian graph. An obvious necessary condition is that the graph be 
connected; however, there is a connected undirected graph with four vertices that is not Hamiltonian. Can you draw such a graph? 

A Note on What Is Possible and What Is Impossible. The search for a Hamiltonian path in a graph is typical of many simple-sounding 
problems in graph theory that have proven to be very difficult to solve. Although there are simple algorithms for conducting the search, they are 
impractical for large problems because they take such a long time to complete as graph size increases. Currently, every algorithm to search for a 
Hamiltonian path in a graph takes exponential time to complete. That is, if T (n) is the time it takes to search a graph of n vertices, then there is 
a positive real number a, a > 1 , such that T (n) > a" for all but possibly a finite number of positive values for n. No matter how close to 1 we 
can make a, a" will grow at such a fast rate that the algorithm will not be feasible for large values of n. For a given algorithm, the value of a 
depends on the relative times that are assigned to the steps, but in the search for Hamiltonian paths, the actual execution time for known 
algorithms is large with 20 vertices. For 1,000 vertices, no algorithm is likely to be practical, and for 10,000 vertices, no currently known 
algorithm could be executed. 

It is an unproven but widely held belief that no faster algorithm exists to search for Hamiltonian paths . A faster algorithm would have to be one 
that takes only polynomial time; that is, T (n) < p(n), for some polynomial sequence p. To sum up, the problem of determining whether a 
graph is Hamiltonian is theoretically possible; however, for large graphs we consider it a practical impossibility. Many of the problems we will 
discuss in the next section, particularly the Traveling Salesman Problem, are thought to be impossible in the same sense. 

Definition: The n-cube. Let n > 1, and let B" be the set of strings ofO's and 1 's with length n. The n-cube is the undirected graph with 
a vertex for each string in B" and an edge connecting each pair of strings that differ in exactly one position. 

The 1-cube, 2-cube, 3-cube, and 4-cube are shown in Figure 9.4.6. 
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Figure 9.4.6 

n-cubes, n = 1, 2, 3, 4 

The Gray Code. A Hamiltonian circuit of the n-cube can be described recursively. The circuit itself, called the Gray Code, is not the only 
Hamiltonian circuit of the n-cube, but it is the easiest to describe. The standard way to write the Gray Code is as a column of strings, where the 
last string is followed by the first string to complete the circuit. 

Basis (n = 1): The Gray Code for the 1-cube is Gi = I . Note that the edge between and 1 is used twice in this circuit. That doesn't 

violate any rules for Hamiltonian circuits , but can only happen if a graph as two vertices . 

Recursion: Given the Gray Code for the rc-cube, n > 1, then G n+ \ is obtained by (1) listing G„ with each string prefixed with 0, and then (2) 
reversing the list of strings in G„ with each string prefixed with 1 . Symbolically, the recursion can be expressed as 

_(0G„ 
G " +1 - { 1 G„< 

where G,,' is the reverse of list G„. The Gray Codes for the 2-cube and 3-cube are 



C 2 



Applications of the Gray Code. One application of the Gray code was discussed in the Introduction to this book. An other application is in 
statistics. In a statistical analysis, there is often a variable that depends on several factors, but exactly which factors are significant may not be 
obvious. For each subset of factors, there would be certain quantities to be calculated. One such quantity is the multiple correlation coefficient 
for a subset. If the correlation coefficient for a given subset, A, is known, then the value for any subset that is obtained by either deleting or 
adding an element to A can be obtained quickly. To calculate the correlation coefficient for each set, we simply travel along G„, where n is the 
number of factors being studied. The first vertex will always be the string of 0's, which represents the empty set. For each vertex that you visit, 

the set that it corresponds to contains the k th factor if the k th character is a 1 . 
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EXERCISES FOR SECTION 9.4 
A Exercises 

1 . Locate a map of New York City and draw a graph that represents its land masses, bridges and tunnels. Is there a Eulerian path through New 
York City? You can do the same with any other city that has at least two land masses. 

2. Which of the drawings in Figure 9.4.7 can be drawn without removing your pencil from the paper and without drawing any line twice? 




Figure 9.4.7 

Exercise 2 

3 . Write out the Gray Code for the 4-cube. 

4. Find a Hamiltonian circuit for the dodecahedron graph in Figure 9.4.4. 

5 . The Euler Construction Company has been contracted to construct an extra bridge in Konigsberg so that a Eulerian path through the town 
exists. Can this be done, and if so, where should the bridge be built? 

6. (a) Determine which of the graphs in Figure 9.4.8 has a Eulerian path? 
(b) Find a Eulerian path for the graphs that have one. 
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Figure 9.4.8 

Exercise 6 

B Exercises 

7. Formulate Euler's theorem for directed graphs. 

8. Prove that the number of vertices in an undirected graph with odd degree must be even. (Hint: Prove by induction on the number of edges.) 

9. (a) Under what conditions will a round-robin tournament graph be Eulerian? 
(b) Prove that every round-robin tournament graph is Hamiltonian. 

10. For what values of n is rc-cube Eulerian. 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3 .0 United States 



Chapter 9 - Graph Theory 



I 9.5 Graph Optimization 



The common thread that connects all of the problems in this section is the desire to optimize (maximize or minimize) a quantity that is associ- 
ated with a graph. We will concentrate most of our attention on two of these problems, the Traveling Salesman Problem and the Maximum 
Flow Problem. At the close of this section, we will discuss some other common optimization problems. 

Definition: Weighted Graph. A weighted graph, (V , E, w), is a graph (V, E) together with a weight function w : E -» R. If e e E, 
w (e) is the weight on edge e. 

As you will see in our examples, w (e) is usually a cost associated with the edge e; therefore, most weights will be positive. 

Example 9.5.1. Let V be the set of six capital cities in New England: Boston, Augusta, Hartford, Providence, Concord, and Montpelier. Let E 
be {{a, b\ e V X V \ a + b}\ that is, (V, E) is a complete unordered graph. An example of a weight function on this graph is 

w(c\, c 2) = the distance from Ci to C2 • 
Many road maps define distance functions as in Figure 9.5.1. 

ME MA NH CT VT RI 

Augusta, ME - 165 148 266 190 208 

Boston, MA 165 - 75 103 192 43 

Concord, NH 148 75 - 142 117 109 

Hartford, CT 266 103 142 - 204 70 

Montpelier, VT 190 192 117 204 - 223 

Providence, RI 208 43 109 70 223 - 

FIGURE 9.5.1 

Distances between capital cities of New England 

The Traveling Salesman Problem 

The Traveling Salesman Problem is, given a weighted graph, to find a circuit (ei, e 2 , •••» e n) that visits every vertex at least once and mini- 
mizes the sum of the weights, 

n 

E Med 
Any such circuit is called an optimal path. 

Notes, (a) Some statements of the Traveling Salesman Problem require that the circuit be Hamiltonian. In many applications, the graph 
in question will be complete and this restriction presents no problem. 

(b) If the weight on each edge is constant, for example, w (e) = 1 , then the solution to the Traveling Salesman Problem will be any Hamilto- 
nian circuit, if one exists. 

Example 9.5.2. The Traveling Salesman Problem gets its name from the situation of a salesman who wants to minimize the number of 
miles that he travels in visiting his customers. For example, if a salesman from Boston must visit the other capital cities of New England, then 
the problem is to find a circuit in the weighted graph of Example 9.5.1. Note that distance and cost are clearly related in this case. In 
addition, tolls and traffic congestion might also be taken into account in this case. 

The search for an efficient algorithm that solves the Traveling Salesman has occupied researchers for years. If the graph in question is 
complete, there are (n - 1) ! different circuits. As n gets large, it is impossible to check every possible circuit. The most efficient algorithms for 
solving the Traveling Salesman Problem take an amount of time that is proportional to n2". Since this quantity grows so quickly, we can't 
expect to have the time to solve the Traveling Salesman Problem for large values of n. Most of the useful algorithms that have been developed 
have to be heuristic; that is, they find a circuit that should be close to the optimal one. One such algorithm is the "closest neighbor" algorithm, 
one of the earliest attempts at solving the Traveling Salesman Problem. The general idea behind this algorithm is, starting at any vertex, to visit 
the closest neighbor to the starting point. At each vertex, the next vertex that is visited is the closest one that has not been reached. This 
shortsighted approach typifies heuristic algorithms called greedy algorithms, which attempt to solve a minimization (maximization) problem by 
minimizing (maximizing) the quantity associated with only the first step. 

Algorithm 9.5.1. The Closest Neighbor Algorithm. Let G = (V, E, w) be a complete weighted graph with \ V | = n. The closest 
neighbor circuit through G starting at V\ is (v h v 2 , . . ., v„), defined by the steps: 

1. Vj = V-{v x ). 

2 . For k = 2 to n — 1 

2.1 v k = the closest vertex in V k _j to v k _j 

(* w ( v k-] • v k)= min ( w ( v k-l ,V) \ V £ V k _,) *) 

In case of a tie for closest, v k may be chosen arbitrarily . 

2.2 V k = V k _, -{v k } 

3. v n = the only element of V „ . 

The cost of the closest neighbor circuit is 

n-[ 

Z w(v k , V k+l ) + w(v„, V,) 

k=l 

Example 9.5.3. The closest neighbor circuit starting at A in Figure 9.5.2 is (1, 3, 2, 4, 1), with a cost of 29. The optimal path is 
(1, 2, 3,4, 1), with a cost of 27. 
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FIGURE 9.5.2 

Example 9.5.3 



Although the closest neighbor circuit is often not optimal, we may be satisfied if it is close to optimal. If C opt and C cn are the costs of optimal 

... ... Cc 

and closest neighbor circuits in a graph, then it is always the case that C opt < C cn or — > 1. We can assess how good the closest neighbor 

c 
algorithm is by determining how small the quantity - 2 - gets. If it is always near 1 , then the algorithm is good. However, if there are graphs for 

CoDt 



1.074. A 7% increase in cost may or may not 



Example 9.5.4. A salesman must make stops at vertices A, B, and C, which are all on the same one-way street. The graph in Figure 9.5.3 
is weighted by the function 



which it is large, then the algorithm may be discarded. Note that in Example 9.5.3, 

be considered significant, depending on the situation. 

ample 9.5.4. A salesman must make stops at i 
nted by the function 

w ('< j) — the time it takes to drive from vertex /to vertex/ 
at if j is down the one-way street from i, then w (i, j) < w 
C cn is 32 by using the closest neighbor algorithm. The value of 



Note that if j is down the one-way street from i, then w (;', j) < w (j. i). The values of C opl , and C cn are 20 and 32, respectively. Verify that 
more time on the road if he used the closest neighbor algorithm. 



1.6 is significant in this case since our salesman would spend 60% 




FIGURE 9.5.3 

Example 9.5.4 

A more general result relating to the closest neighbor algorithm presumes that the graph in question is complete and that the weight function 
satisfies the conditions 

(1) w(x, v) = w(y, x) for all x, y in the vertex set, and 

(2) w (x, y) + w(y, z) a w (x, z) for all x, y, z in the vertex set. 

The first condition is called the symmetry condition and the second is the triangle inequality. 

The following theorem's reference needs to be updated: 

Theorem 9.5.1 . If(V, E, w) is a complete weighted graph that satisfies the symmetry and triangle inequality conditions, then 
^-<^ (9.5a) 
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Proof: See Liu, pages 105-109. 
Notes: (a) If | V \ = 8, then this theorem says that C c „ can be no larger than twice the size of C opt ', however, it doesn't say that the closest 

flog (2 m)1 c 

neighbor circuit will necessarily be that far from an optimal circuit. The quantity — is called an upper bound for the ratio — — . It tells us 

2 C op t 

only that things can't be any worse than the upper bound. Certainly, there are many graphs with eight vertices such that the optimal and closest 
neighbor circuits are the same. What is left unstated in this theorem is whether there are graphs for which the quantities in 9.5a are equal. If 
there are such graphs, we say that the upper bound is sharp. 

(b) The value of — — in Example 9.5.4 is 1.6, which is greater than — - = 1.5; however, the weight function in this example does not 

Copt 2 

satisfy the conditions of the theorem. 

The Traveling Salesman Problem— Unit Square Version 

Example 9.5.5. A robot is programmed to weld joints on square metal plates. Each plate must be welded at prescribed points on the square. To 
minimize the time it takes to complete the job, the total distance that a robot's arm moves should be minimized. Let d(P, Q) be the distance 
between P and Q. Assume that before each plate can be welded, the arm must be positioned at a certain point Pn . Given a list of n points, we 
want to put them in order so that 

d(P Q ,P,) + d(P l ,P 2 ) +-+d{P n - U Pn)+d{P n ,P ) 

is as small as possible. 

The type of problem that is outlined in Example 9.5.5 is of such importance that it is probably the most studied version of the Traveling 
Salesman Problem. What follows is the usual statement of the problem. Let [0, 1J = {x e IR | <x< 1}, and let S = [0, 1J 2 , the unit 
square. Given n pairs of real numbers (x\ , y{), (x 2 , y 2 ), ■ ■ ■ > (x n , Jn) in S that represent the n vertices of a K n , find a circuit of the graph that 
minimizes the sum of the distances traveled in traversing the circuit. 

Since the problem calls for a circuit, it doesn't matter which vertex we start at; assume that we will start at (xi, y\). Once the problem is solved, 
we can always change our starting position. A function can most efficiently describe a circuit in this problem. Every bijection 
/: {1, ..., n) -> {1, ..., n)with/(l) = 1 describes a circuit 

(xu yi),(x m ,y m ), ...,(x f(n) ,y m ) 

Since there are (n - 1) ! such bijections, an examination of all possible circuits is not feasible for large values of n. 

One popular heuristic algorithm is the strip algorithm: 

Algorithm 9.5.2: The Strip Algorithm. Given n points in the unit square: 
Phase 1 : 

(1.1) Divide the square into V n/2 vertical strips, as in Figure 9.5.4. Let d be the width of each strip. If a point lies on a boundary between 
two strips, consider it part of the left-hand strip. 

(1.2) Starting from the left, find the first strip that contains one of the points. Locate the starting point by selecting the first point that is 
encountered in that strip as you travel from bottom to top. We will assume that the first point is (x\, ji) 

(13) Alternate traveling up and down the strips that contain vertices until all of the vertices have been reached. 
(1 .4) Return to the starting point . 
Phase 2: 

(2.1) Shift all strips d/2 units to the right (creating a small strip on the left). 

(2.2) Repeat Steps 1 .2 through 1 .4 of Phase 1 with the new strips. 

When the two phases are complete, choose the shorter of the two circuits obtained. 

Step 1.3 needs a bit more explanation. How do you travel up or down a strip? In most cases, the vertices in a strip will be vertically distributed 
so that the order in which they are visited is obvious. In some cases, however, the order might not be clear, as in the third strip in Phase I of 

Figure 9.5.4. Within a strip, the order in which you visit the points (if you are going up the strip) is determined thusly: (x,-, yi) precedes \Xj, j ; J 

if yt < yj or if j, = j ; - and x, < x ; - . In traveling down a strip, replace j, < yj with j, > yj. 
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The Strip Algorithm 
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The selection of yn/2 strips was made in a 1959 paper by Beardwood, Halton, and Hammersley. It balances the problems that arise if the 

number of strips is too small or too large. If the square is divided into too few strips, some strips may be packed with vertices so that visiting 
them would require excessive horizontal motion. If too many strips are used, excessive vertical motion tends to be the result. An update on what 
is known about this algorithm is contained in the paper by K. J. Supowit, E. M. Reingold, and D. A. Plaisted. 

Since the construction of a circuit in the square consists of sorting the given points, it should come as no surprise that the strip algorithm 
requires a time that is roughly a multiple of n log n time units when n points are to be visited. 



The worst case that has been encountered with this algorithm is one in which the circuit obtained has a total distance of approximately 
(see Sopowit et al.). 



'in 



NETWORKS AND THE MAXIMUM FLOW PROBLEM 

Definition: Network. A network is a simple weighted directed graph that contains two distinguished vertices called the source and the 
sink with the property that the indegree of the source and outdegree of the sink are both zero. The weight function o homell2 

n a network is the capacity function . 

An example of a real situation that can be represented by a network is a city's water system. A reservoir would be the source, while a distribu- 
tion point in the city to all of the users would be the sink. The system of pumps and pipes that carries the water from source to sink makes up 
the remaining network. We can assume that the water that passes through a pipe in one minute is controlled by a pump and the maximum rate is 
determined by the size of the pipe and the strength of the pump. This maximum rate of flow through a pipe is called its capacity and is the 
information that the weight function of a network contains . 

Example 9.5.6. Consider the system that is illustrated in Figure 9.5.5. The numbers that appear next to each pipe indicate the capacity of 
that pipe in thousands of gallons per minute. This map can be drawn in the form of a network, as in Figure 9.5 .6. 




FIGURE 9.5.5 

Diagram of a city's water system 

Although the material passing through this network is water, networks can also represent the flow of other materials, such as automobiles, 
electricity, telephone calls or patients in a health system. 
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FIGURE 9.5.6 

Flow diagram for a city's water system 

The Maximum Flow Problem is derived from the objective of moving the maximum amount of water or other material from the source to the 
sink. To measure this amount, we define a flow as a function / : E -» IR such that (1) the flow of material through any edge is nonnegative and 
no larger than its capacity: < f(e) < w (e), for all e e E; and (2) for each vertex other than the source and sink, the total amount of material 
that is directed into a vertex is equal to the total amount that is directed out: 



Z /(*.v) 

(x,v)eB 
Flow into v 



Z f(y,y) 

0,y)e£ 

Flow out of v 



(9.5b) 



The summation notation on the left of 9.5b represents the sum of the flows through each edge in E that has v as a terminal vertex. The right- 
hand side indicates that you should add all of the flows through edges that initiate at v. 

Theorem 9.5.2. Iff is a flow, then 



2 f (source, v) 



■ Z /(v, sink) 
(v,sink)eE 



This common value is called the value of the flow. We will denote the value of a flow by V (/). The value of a flow represents the amount of 
material that passes through the network with that flow. 

Proof. Subtract the right-hand side of 9.5b from the left-hand side. The result is: 



Flow into v - Flow out of v = 
Now sum up these differences for each vertex in V = V 



Z Z /C*,v)- Z f(y,y) 

veV \(x,v)<eE (v.y)e£ 







{source, sink). The result is 



(9.5c) 



Now observe that if an edge connects two vertices in V, its flow appears as both a positive and a negative term in 9.5c. This means that the only 
positive terms that are not cancelled out are the flows into the sink. In addition, the only negative terms that remain are the flows out of the 
source. Therefore, 



2 /(v, sink) - Z /(source, v) = ■ 

(v,sink)e£ (source,v)e£ 

MAXIMAL FLOWS 

Since the Maximum Flow Problem consists of maximizing the amount of material that passes through a given network, it is equivalent to 
finding a flow with the largest possible value. Any such flow is called a maximal flow. 

For the network in Figure 9.5.6, one flow is fi , defined by /i(ei) = 25, /ife) = 20, /ife) = 0, fi(e^) = 25, and /i(e 5 ) = 20. The value of f\ , 
V(fi), is 45. Since the total flow into the sink can be no larger than 50 (w (e 4 ) + w {e s ) = 30 + 20), we can tell that f\ is not very far from the 
solution. Can you improve on f\ at all? The sum of the capacities into the sink can't always be obtained by a flow. The same is true for the sum 
of the capacities out of the source. In this case, the sum of the capacities out of the source is 60, which obviously can't be reached in this 
network. 

A solution of the Maximum Flow Problem for this network is the maximal flow f 2 , where f^ifii) = 25, fcigi) = 25, f2(e 3 ) = 5,f 2 (e4) = 30, and 
fi( e 5) - 20, with V(fi) = 50. This solution is not unique. In fact, there is an infinite number of maximal flows for this problem. 

There have been several algorithms developed to solve the Maximal Flow Problem. One of these is the Ford and Fulkerson Algorithm (FFA). 
The FFA consists of repeatedly finding paths in a network called flow augmenting paths until no improvement can be made in the flow that has 
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been obtained. 

Definition: Flow Augmenting Path. Given a flow f in a network (V, E), a flow augmenting path with respect to f is a simple path from 
the source to the sink using edges both in their forward and their reverse directions such that for each edge e in the path, w (e) - f (e) > 
if e is used in its forward direction and f (e) > if e is used in the reverse direction. 

Example 9.5.7. For f in Example 9.5 .6, a flow augmenting path would be (e 2 , £3 , £4 ) since 

wfe) - f\(ei) = 15, w(e 3 ) - f(e 3 ) = 5, and w(e 4 ) - f(e A ) = 5. 

These positive differences represent unused capacities , and the smallest value represents the amount of flow that can be added to each edge in 
the path. Note that by adding 5 to each edge in our path, we obtain /j, which is maximal. If an edge with a positive flow is used in its reverse 
direction, it is contributing a movement of material that is counterproductive to the objective of maximizing flow. This is why the algorithm 
directs us to decrease the flow through that edge. 

Algorithm 9.5.3: The Ford andFulkerson Algorithm. 

(1) Define the flow function f by f (e) = for each edge e e E. 

(2) i = 0. 

(3) Repeat: 

(3.1) If possible, find a flow augmenting path with respect to f. 

(3.2) If a flow augmenting path exists, then: 

(3.2.1) Determine 

d =min{{w(e) - f(e) \ e is used in the forward direction), 
{fi(e) I e is used in the reverse direction}} 

(3.2.2) Define f +l by 

f + \(e) = f,(e) if e is not part of the flow augmenting path 
f + i(e) = fj(e) + d if e is used in the forward direction 

f + i (e) = f{e) - d if e is used in the reverse direction 

(3.2.3) ( = /+ 1. 

until no flow augmenting path exists. 

(4) Terminate with a maximal flow f 
Notes: 

(a) It should be clear that every flow augmenting path leads to a flow of increased value and that none of the capacities of the network can be 
violated. 

(b) The depth-first search should be used to find flow augmenting paths since it is far more efficient than the breadth-first search in this 
situation. The depth-first search differs from the broadcasting algorithm (a variation of the breadth-first search) in that you sequentially visit 
vertices until you reach a "dead end" and then backtrack. 

(c) There have been networks discovered for which the FFA does not terminate in a finite number of steps . These examples all have irrational 
capacities. It has been proven that if all capacities are positive integers, the FFA terminates in a finite number of steps. See Ford and Fulkerson, 
Even, or Beige for details. 

(d) When you use the FFA to solve the Maximum Flow Problem by hand it is convenient to label each edge of the network with the fraction 
f(e)/w(e). 

A Depth-First Search for the Sink Initiating at the Source. Let E ' be the set of directed edges that can be used in producing a flow 
augmenting path. Add to the network a vertex called start and the edge (start, source). 

(1) S = vertex set of the network. 

(2) p = start. 

(3) p = source (*Move p along the edge (start, source) *) 

(4) While p is not equal to start or sink do. 

If an edge in E ' exists that takes you from p to another vertex in S 
then set p to be that next vertex and delete the edge from E '. 
else reassign p to be the vertex that p was reached from (i.e., backtrack) . 

(5) If p = start, 

then no flow augmenting path exists. 

else p = sink, you have found a flow augmenting path. 

Example 9.5.8. Consider the network in Figure 9.5.7, where the current flow, /, is indicated by a labeling of the edges. The path 
(Source, v 2 , V\, V3 , Sink) is a flow augmenting path that allows us to increase the flow by one unit. Note that (i»i, V3) is used in the reverse 
direction, which is allowed because /(vj , V3) > 0. The value of the new flow that we obtain is 8. This flow must be maximal since the capacities 
out of the source add up to 8. This maximal flow is defined by the labeling of Figure 9.5.8. 
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FIGURE 9.5.7 

Current flow in Example 9.5.8 




FIGURE 9.5.8 

Augmented, optimal flow in Example 9.5.8 

OTHER GRAPH-OPTIMIZATION PROBLEMS 

(a) The Minimum Spanning Tree Problem: Given a weighted graph, (V, E, w), find a subset E' of E with the properties that (V, E') is 
connected and the sum of the weights of edges in E' are as small as possible. We will discuss this problem in Chapter 10. 

(b) The Minimum Matching Problem: Given an undirected weighted graph, (KE, w), with an even number of vertices, pair up the vertices so 
that each pair is connected by an edge and the sum of these edges is as small as possible. A unit square version of this problem has been studied 
extensively. See References: [SopowitOOl] for details on what is known about this version of the problem. 

(c) The Graph Center Problem: Given a connected, undirected, weighted graph, find a vertex (the center) in the graph with the property that 
the distance from the center to every other vertex is as small as possible. "As small as possible" could be interpreted either as minimizing the 
sum of the distances to each vertex or as minimizing the maximum distance from the center to a vertex. 

EXERCISES FOR SECTION 9.5 
A Exercises 

1. Find the closest neighbor circuit through the six capitals of New England starting at Boston. If you start at a different city, will you get a 
different circuit? 

2. Is Theorem 9.5.1 sharp for n = 3? For n = 4? 

3. Given the following sets of points in the unit square, find the shortest circuit that visits all the points and find the circuit that is obtained with 
the strip algorithm. 

(a) {(0.1 /t, 0.1 k) : k = 0, 1, 2, ..., 10) 

(b) {(0.1, 0.3), (0.3, 0.8), (0.5, 0.3), (0.7, 0.9), (0.9, 0.1)) 

(c) {(0.0, 0.5), (0.5, 0.0), (0.5, 1.0), (1.0, 0.5)) 

(d) {(0, 0), (0.2, 0.6), (0.4, 0.1), (0.6, 0.8), (0.7, 0.5)) 

4. For n = 4, 5, and 6, locate n points in the unit square for which the strip algorithm works poorly. 

5. Consider the network whose maximum capacities are shown on the following graph. 
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Source 4 *■ b 



(a) A function /is partially defined on the edges of this network by: 

/(Source, c) = /(Source, b) =/ (Source, a) = 2, and /(a, d) = 1, 
Define /on the rest of the other edges so that /is a flow. What is the value of/ ? 

(b) Find a flow augmenting path with respect to/for this network. What is the value of the augmented flow? 

(c) Is the augmented flow a maximum flow? Explain. 

6. Given the following network with capacity function c and flow function/ find a maximal flow function. The labels on the edges of the 
network are of the form f (e)/c (e), where c (e) is the capacity of edge e and / (e) is the used capacity for flow/. 




7. Find maximal flows for the following networks. 
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(b) 



Source 3 — >- — b 




Sink 



(C) 




B Exercises 

8. (a) [Easy] Find two maximal flows for the network in Figure 9.5.6 other than the one found in the text, 
(b) [Harder] Describe the set of all maximal flows for the same network. 

(c) [Hardest] Prove that if a network has two maximal flows, then it has an infinite number of maximal flows. 

9. Discuss reasons that the closest neighbor algorithm is not used in the unit square version of the Traveling Salesman Problem. (Hint: Count 
the number of comparisons of distances that must be done.) 

C Exercises 

10. Explore the possibility of solving the Traveling Salesman Problem in the "unit box": [0, l] 3 . 

1 1 . Devise a "closest neighbor" algorithm for matching points in the unit square. 
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I 9.6 Planarity and Colorings 

The topics in this section are related to how graphs are drawn. 

Planarity: Can a given graph be drawn in a plane so that no edges intersect? Certainly, it is natural to avoid intersections, but up to now we 
haven't gone out of our way to do so. 

Colorings: Suppose that each vertex in an undirected graph is to be colored so that no two vertices that are connected by an edge have the same 
color. How many colors are needed? This question is motivated by the problem of drawing a map so that no two bordering countries are colored 
the same. A similar question can be asked for coloring edges. 

Definition: Planar Graphl Plane Graph/Planar Embedding. A graph is planar if it can be drawn in a plane so that no edges cross. If a 
graph is drawn so that no edges intersect, it is a plane graph, and such a drawing is a planar embedding of the graph. 

Example 9.6.1 . The graph in Figure 9.6.1(a) is planar but not a plane graph. The same graph is drawn as a plane graph in Figure 9.6.1(b) 




(b) 




Figure 9.6.1 

A planar graph and a planar embedding 



Notes: 



(a) In discussing planarity, we need only consider simple undirected graphs with no self-loops. All other graphs can be treated as such since all 
of the edges that relate any two vertices can be considered as one "package" that clearly can be drawn in a plane. 

(b) Can you think of a graph that is not planar? How would you prove that it isn't planar? Proving the nonexistence of something is usually 
more difficult than proving its existence. This case is no exception. Intuitively, we would expect that sparse graphs would be planar and dense 
graphs would be nonplanar. Theorem 9.6.2 will verify that dense graphs are indeed nonplanar. 

(c) The topic of planarity is a result of trying to restrict a graph to two dimensions. Is there an analogous topic for three dimensions? What 
graphs can be drawn in one dimension? 

Answer to Note c: If a graph has only a finite number of vertices, it can always be drawn in three dimensions. This is not true for all graphs with 
an infinite number of vertices. The only "one-dimensional" graphs are the ones that consist of a finite number of chains, as in Figure 9.6.2, with 
one or more vertices in each chain. 







Figure 9.6.2 

Chains of length one, two and three 



Example 9.6.2. A discussion of planarity is not complete without mentioning the famous Three Utilities Puzzle. The object of the puzzle is to 
supply three houses, A, B, and C, with the three utilities, gas, electric, and water. The constraint that makes this puzzle impossible to solve is 
that no utility lines may intersect i. e., a planar embedding of the graph in Figure 9.6.3, which is commonly denoted K^j. This graph is one of 
two fundamental nonplanar graphs. The Kuratowski Reduction Theorem states that if a graph is nonplanar then "contains" either a Kt, 3 or a K s . 
Containment is in the sense that if you start with a nonplanar graph you can always perform a sequence of edge deletions and contractions 
(shrinking an edge so that the two vertices connecting it coincide) to produce one of the two graphs . 
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Figure 9.6.3 

The Three Utilities Puzzle. 

A planar graph divides the plane into one or more regions . Two points on the plane lie in the same region if you can draw a curve connecting 
the two points that does not pass through an edge. One of these regions will be of infinite area. Each point on the plane is either a vertex, a point 
on an edge, or a point in a region. A remarkable fact about the geography of planar graphs is the following theorem that is attributed to Euler. 

Theorem 9.6.1: Euler's Formula. If G = (V, E) is a connected planar graph with r regions, v vertices and e edges, then 
v + r - e = 2 (9.6a) 

Experiment: Jot down a graph right now and count the number of vertices, regions, and edges that you have. If v + r — e is not 2, then your 
graph is either nonplanar or not connected. 

Proof: We prove Euler's Formula by Induction on e, for e > 0. 

Basis: If e = 0, then G must be a graph with one vertex, v = 1; and there is one infinite region, r = 1. 

Therefore, v + r - e = 1 + 1-0 = 2, and the basis is true. 

Induction: Suppose that G has k edges, k > 1, and that all connected planar graphs with less than k edges satisfy 9.6a. Select any edge that is 
part of the boundary of the infinite region and call it e\ . Let G ' be the graph obtained from G by deleting e\ . Figure 9.6.4 illustrates the two 
different possibilities we need to consider: either G' is connected or it has two connected components, G\ and G 2 . 



Case 1: 




Case 2: 




Figure 9.6.4 

Two case in the proof of Euler's Formula 

If G' is connected, the induction hypothesis can be applied to it. If G' has v' vertices, r' 
terms of the corresponding numbers for G, 

v ' = v No vertices were removed to form G ' 

r' = r - 1 One region of G merged with the infinite region when e\ is removed 

e ' = k - 1 We assumed that G had k edges . 

For the case where G ' is connected, 



edges and e' edges, then 



; 2 and in 



v + r — e = v + r - k 

= v' + (r'+ 1) -(e' + 1) 
= v ' + r ' - e ' 

= 2 



If G ' is not connected, it must consist of two connected components, Gi and G 2 since we started with a connected graph, G. We can apply 
the induction hypothesis to each of the two components to complete the proof. We leave it to the students to do this , with the reminder that 
in counting regions, G\ and G 2 will share the same infinite region. ■ 

Theorem 9.6.2. If G = (V, E) is a connected planar graph with v vertices, v > 3, and e edges, then 



e < 3v 



(9.6b) 
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Remark: One implication of 9.6b is that the number of edges in a connected planar graph will never be larger than three times its 
number .of vertices (as long as it has at least three vertices). Since the maximum number of edges in a graph with v vertices is a quadratic 
function of v, as v increases , planar graphs are more and more sparse. 

Outline of a Proof of Theorem 9.6.2. 

(a) Let r be the number of regions in G. For each region, count the number of edges that comprise its border. The sum of these counts must 
be at least 3r. Recall that we are working with simple graphs here, so a region made by two edges connecting the same two vertices is not 
possible. 

(b) Based on (a), infer that the number of edges in G must be at least — . 

(c) e > — => r < — 

(d) Substitute — for r in Euler's Formula to obtain an inequality that is equivalent to 9.6 .b. ■ 
The following theorem will be useful as we turn to graph coloring. 

Theorem 9.6.3. If G is a connected planar graph, then it has a vertex with degree 5 or less. 

Proof (by contradiction): We can assume that G has at least seven vertices, for otherwise the degree of any vertex is at most 5. Suppose 
that G is a connected planar graph and each vertex has a degree of 6 or more. Then, since each edge contributes to the degree of two vertices, 

e > — = 3 v. However, Theorem 9.6.2 states that the e < 3 v - 6 < 3 v, which is a contradiction. ■ 

2 

GRAPH COLORING 




Figure 9.6.5 

A 3-coloring of Euler Island 

The map of Euler Island in Figure 9.6.5 shows that there are seven towns on the island. Suppose that a cartographer must produce a colored 
map in which no two towns that share a boundary have the same color. To keep costs down, she wants to minimize the number of different 
colors that appear on the map. How many colors are sufficient? For Euler Island, the answer is three. This problem motivates a more general 
problem. 

The Graph Coloring Problem. Given an undirected graph G = (V, E), find a "coloring function" /from V into a set of colors H such that 

(Vj, vj] e£ => /(v,) =£ /(v/J and H has the smallest possible cardinality. The cardinality of// is called the chromatic number of G, x (G). 

Notes: 

(a) A coloring function onto an n element set is called an ^-coloring. 

(b) In terms of this general problem, the chromatic number of the graph of Euler Island is three. To see that no more than three colors are 
needed, we need only display a 3-coloring: /(l) = /(4) = /(6) = blue, /(2) = red, and /(3) = /(5) = /(7) = white. This coloring is not 
unique. The next smallest set of colors would be of two colors, and you should be able to convince yourself that no 2-coloring exists for this 
graph. 

In the mid-nineteenth century, it became clear that the typical planar graph had a chromatic number of no more than 4. At that point, mathemati- 
cians attacked the Four-Color Conjecture, which is that if G is any planar graph, then its chromatic number is no more than 4. Although the 
conjecture is quite easy to state, it took over 100 years, until 1976, to prove the conjecture in the affirmative. 

Theorem 9.6.4: The Four-Color Theorem. If G is a planar graph, then ^(G) < 4. 
A proof of the Four-Color Theorem is beyond the scope of this text, but we can prove a theorem that is only 25 percent inferior. 

Theorem 9.6.5: The Five-Color Theorem. If G is a planar graph, then Jf{G) < 5. 

The number 5 is not a sharp upper bound for x(G) because of the Four-Color Theorem. 

Proof, by Induction on the Number of Vertices in the Graph: 

Basis: Clearly, a graph with one vertex has a chromatic number of 1 . 

Induction: Assume that all planar graphs with n - 1 vertices have a chromatic number of 5 or less. Let G be a planar graph. By Theorem 9.6.2, 
there exists a vertex v with deg v < 5. Let G - v be the planar graph obtained by deleting v and all edges that connect v to other vertices in G. 
By the induction hypothesis, G - v has a 5-coloring. Assume that the colors used are red, white, blue, green, and yellow. 
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If degv < 5, then we can produce a 5-coloring of G by selecting a color that is not used in coloring the vertices that are connected to v with an 
edge in G. 

If degv = 5, then we can use the same approach if the five vertices that are adjacent to v are not all colored differently. We are now left with 
the possibility that Vi, V2, V3, V4, and v 5 are all connected to v by an edge and they are all colored differently. Assume that they are colored red, 
white blue, yellow, and green, respectively, as in Figure 9.6.6. 



Red 



White 



iGreen 




Blue 



Figure 9.6.6 

Starting at Vi in G - v, suppose we try to construct a path V3 that passes through only red and blue vertices. This can either be accomplished or 
it can't be accomplished. If it can't be done, consider all paths that start at V] , and go through only red and blue vertices. If we exchange the 
colors of the vertices in these paths, including V] we still have a 5-coloring of G - v. Since V\ is now blue, we can color the central vertex, v, 
red. 

Finally, suppose that Vj is connected to V3 using only red and blue vertices. Then a path from Vi to V3 by using red and blue vertices followed by 
the edges (V3, v) and (v, V]) completes a circuit that either encloses V2 or encloses V4 and v 5 . Therefore, no path from Vi to V4 exists using only 
white and yellow vertices. We can then repeat the same process as in the previous paragraph with V2 and V4 , which will allow us to color v 
white. ■ 

Definition: Bipartite Graph. A bipartite graph is a graph that has a 2-coloring. Equivalently, a graph is bipartite if its vertices can be 
partitioned into two nonempty subsets so that no edge connects a vertices from the same from each subset. 

Example 9.6.3. 

(a) The graph of the Three Utilities Puzzle is bipartite. The vertices are partitioned into the utilities and the homes. Of course a 2-coloring of 
the graph is to color the utilities red and the homes blue. 

(b) For n > 1, the rc-cube is bipartite. A coloring would be to color all strings with an even number of l's red and the strings with an odd 
number of l's blue. By the definition of the rc-cube, two strings that have the same color couldn't be connected since they would need to differ 
in at least two positions. 

(c) Let Vbe a set of 64 vertices, one for each square on a chess board. We can index the elements of Vby 

Vjj = the square on the row i, column j. 
Connect vertices in V according to whether or not you can move a knight from one square to another. Using our indexing of V, 

\i-k\ + \j-I\ = 3 
and I i - k \ ■ \ j - 1 | = 2 
(V, E) is a bipartite graph. The usual coloring of a chessboard is valid 2-coloring. 

How can you recognize whether a graph is bipartite? Unlike planarity, there is a nice equivalent condition for a graph to be bipartite. 
Theorem 9.6.6. An undirected graph is bipartite if and only if it has no circuit of odd length. 

Proof. (=>) Let G = (V, E) be a bipartite graph that is partitioned into two sets, R(ed) and B(lue) that define a 2-coloring. Consider any circuit 
in V. If we specify a direction in the circuit and define /on the vertices of the circuit by 

f(u) = the next vertex in the circuit after v. 

Note that /is abijection. Hence the number of red vertices in the circuit equals the number of blue vertices, and so the length of the circuit must 
be even. 

(<=) Assume that G has no circuit of odd length. For each component of G, select any vertex w and color it red. Then for every other vertex v 
in the component, find the path of shortest distance from w to v. If the length of the path is odd, color v blue, and if it is even, color v red. We 
claim that this method defines a 2-coloring of G. Suppose that it does not define a 2-coloring. Then let v a and v b be two vertices with identical 



(yij, v k iJg E if and only if 
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colors that are connected with an edge. By the way that we colored G, neither v fl nor v b could equal w. We can now construct a circuit with an 
odd length in G. First, we start at w and follow the shortest path to v„ . Then follow the edge (v 0! v b ), and finally, follow the reverse of a 
shortest path from w to v b . Since v„ and v b have the same color, the first and third segments of this circuit have lengths that are both odd or 
even, and the sum of their lengths must be even. The addition of the single edge (y a , v b ) shows us that this circuit has an odd length. This 
contradicts our premise. ■ 

Note: An efficient algorithm for finding a 2-coloring of a graph can be designed using the method that is used in the second part of the 
proof above. 

EXERCISES FOR SECTION 9.6 
A Exercises 

1 . Apply Theorem 9.6.2 to prove that once n gets to a certain size, a K n is nonplanar. What is the largest complete planar graph? 

2. Can you apply Theorem 9.6.2 to prove that the Three Utilities Puzzle can't be solved? 

3. What are the chromatic numbers of the following graphs? 

(a) (b) 

3 




(c) 



(d) 




(e) 



(f) 




4. Prove that if an undirected graph has a subgraph that is a K 3 it then its chromatic number is at least 3. 

5. What is x(K„),n> 1? 

6. What is the chromatic number of the United States? 

B Exercises 

7 . Complete the proof of Theorem 9 .6 . 1 . 

8. Use the outline of a proof of Theorem 9.6.2 to write a complete proof. Be sure to point out where the premise v > 3 is essential. 

9. Let G = (V, E) with | V | > 11, and let U be the set of all undirected edges between distinct vertices in V. Prove that either G or 
G' = (V, E c ) is nonplanar. 

10. Design an algorithm to determine whether a graph is bipartite. 

1 1 . Prove that a bipartite graph with an odd number of vertices greater than or equal to 3 has no Hamiltonian circuit. 

C Exercises 

12. Prove that any graph with a finite number of vertices can be drawn in three dimensions so that no edges intersect. 

13. Suppose you had to color the edges of an undirected graph so that for each vertex, the edges that it is connected to have different colors. 
How can this problem be transformed into a vertex coloring problem? 
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14. (a) Suppose the edges of a K§ are colored either red or blue. Prove that there will be either a "red ^3" (a subset of the vertex set with three 
vertices connected by red edges) or a "blue ^3." 

(b) Suppose six people are selected at random. Prove that either there exists a subset of three of them with the property that any two people in 
the subset can communicate in a common language, or there exist three people, no two of whom can communicate in a common language. 
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| SUPPLEMENTARY EXERCISES FOR CHAPTER 9 

Section 9.1 

1 . Determine which two of the graphs below are isomorphic and give an explicit isomorphism between those two graphs. 

G 3 






\ / 

k 

2. Teams 1, 2, 3, and 4 compete in a round-robin tournament. Draw a round-robin tournament graph that represents one of the possible out 
comes of the tournament. In terms of your graph, what was the outcome of the tournament (in wins and losses)? 

3. Let G = (V, E) be an undirected graph. An independent set, W, is a subset of V having the property that no two vertices in W are connected 
by an edge in E. That is , 

v, w e W => {v, w) $ E. 
Finding a maximal independent set (an independent set that is as large as possible) is often of interest. 

(a) Find a maximal independent set in the graph of Figure 9.1 .2. 

(b) Prove that if W is a maximal independent set in G, then every vertex in G is connected by an edge to at least one vertex in W. 

(c) How large can a maximal independent set in a K„ be? 

4. Let S = { 1 , 2, . . . , n) be a set of n cities. Define a matrix A = [a, ; l of order nxnby a, ; - = if i = j; otherwise ay is the number of distinct ways 
of traveling directly from city i to city j by car, without visiting any other cities belonging to S en route. 

In the following diagram, the points represent five cities, 1, 2, 3, 4, and 5, and a line is drawn between two cities and labeled with a positive 
integer giving the number of direct routes (by car) between the respective cities . 




(a) Determine the 5x5 matrix A going with the above diagram. 

(b) Calculate A 2 . 

(c) Interpret the meaning of the entries in A- , both for the result in part b, and for the general set 5 = { 1 , 2, . . . , n) 
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5 . An undirected graph can be used to model a map of states or countries , where there is a vertex for each country and an edge between two 
vertices if the two countries share a boundary. For example, the undirected graph of Central America (mainland only) would have eight 
vertices. {Honduras, Nicaragua} would be an edge of the graph, but {Mexico, Nicaragua} would not be an edge. 



MEXICO 








i 



Caribbean Sea 



GUATEMALA 



>»'-i 



HONDURAS 



SALVADOR > -P^ 



NICARAGUA 



PucMcOcean 




<* 



(a) Draw the graph of Central America. 

(b) Find a path from Mexico to Panama. 

(c) What significance would this path have if you intended to drive from Mexico to Panama? 

6. Show that the graph of the states of Washington, Oregon, Idaho, Montana, Wyoming, and Colorado is isomorphic to the graph of New 
England. 

7. The outdegree of a vertex in a directed graph is the number of vertices in the graph that start at that vertex, and the indegree of the vertex is 
the number of vertices in the graph that terminate at that vertex. A vertex v in a tournament graph is a source if the indegree of that vertex is 
zero. A sink is a vertex in a tournament graph that has outdegree of zero. 

(a) Prove that a round-robin tournament graph can have at most one source and at most one sink. Interpret these facts in terms of the 
results of the tournament. 

(b) What is the outdegree of a source in a round-robin tournament graph? What is the indegree of a sink in a round-robin tournament 
graph? 

(c) Let G = (V, E) be a round-robin tournament graph with | V\ > 2. If p is "G has a sink," and q is "G has a source," prove that any one of 
the propositions -i p f\ -i q, -i p /\q, p /\ -i q and p f\q could be true. 

Section 9.2 

8. Let G = (V, E)be a round-robin tournament graph with \V\ = n. If Mis the matrix of G, 

(a) For i = 1, 2, ..., n, explain why the number of l's in column i of M plus the number of Is in row i is always equal to n - 1. 

(b) How many Is are there in M? 

Section 9.3 

9. Use the broadcasting algorithm to determine a shortest path from vertex a to vertex i in the following graph. List the depth sets. 
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10. In a breadth-first (broadcasting) search for a path from vertex A to vertex J, what would the depth sets (D t , D 2 , ...) be? 




Section 9.4 

Definition: Randomly Eulerian. Graph G is randomly Eulerian from vertex v if every path in G that initiates at v and uses edges at random is a 
Eulerian circuit. 

11. Give examples of undirected graphs that are randomly Eulerian from none, one, two, or all vertices. It can be proven that no graph is 
randomly Eulerian from more than two vertices unless it is randomly Eulerian from all vertices. 

12. Prove that G is randomly Eulerian from v if and only if every circuit in G contains v. 

13. Ore's Theorem states that if G = (V, E) is an undirected graph with | V\ = n s 3 such that (v, w) £ E => deg u + deg vsn, then G has a 
Hamiltonian circuit. Prove Ore's Theorem given the following out line. 

Proof by contradiction: 

(a) Add edges to E so that G still has no Hamiltonian circuit, but so that the addition of any other edge does produce a Hamiltonian 
circuit. Now select any {v, w] £ E. There must be a path in G: 

V = (Vi, V2, .... v„) = w. 

(b) Prove that for 2 ^ i ^ n, \v\ , v,( e£=> {v;_i , v„} £E. 

(c) Conclude that deg v + deg w < n. 

Section 9.5 

14. Given: 




(a) Determine the optimal path from the vertex a to the vertex z, 

(b) Use the broadcasting algorithm to find a path from a to z, 

(c) Find the "costs" of both paths and discuss whether the algorithm in part b gives a "good result.' 
15 . Consider the network 
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We start with a flow/on this network partially defined by f(s, c) = f(s, b) = f(s, a) = 1 and/(d, a) = 0. 

(a) Define /for each of the other edges in this network so as to achieve a flow. 

(b) Assume that the maximum capacity of each edge is 2. Determine a flow augmenting path with respect to the flow given in part a. 
(Use the Ford and Fulkerson Algorithm.) 

(c) Is the flow obtained in part b a maximal flow? Explain. 
16. Find a maximal flow for the following network: 




17. (a) Given the mileage chart below, what is the closest neighbor circuit starting at city A? 

A B C D E F 

A _ 34 50 12 51 37 

B 34 _ 40 27 70 60 

C 50 40 _ 30 33 41 

D 12 27 30 _ 39 20 

E 51 70 33 39 _ 15 

F 37 60 41 20 15 

(b) Suppose it takes (k - 1) seconds to determine which of k cities is closest to any given city. Estimate how long it would take to find a 
closest neighbor circuit through n cities. 

Section 9.6 

18. Draw three connected graphs, Go, G3, and G4, with four vertices, each having chromatic numbers 2, 3, and4 (i.e.c(G,) = i. 
19. (a) Prove that although a K 5 is not planar, it can be drawn on a torus (a doughnut-shaped surface). 

(b) Can the Three Utilities Puzzle be solved on a torus? 

20. Draw examples of the following graphs, if possible. Clearly indicate which of the graphs are impossible and why they are impossible. You 
may cite any theorem that justifies your claim. 

(a) An undirected Hamiltonian graph that is not Eulerian. Explain why your graph is not Eulerian. 

(b) A bipartite Hamiltonian graph with five vertices. 

(c) A round-robin tournament graph with four vertices . 

(d) A connected planar graph with four vertices , three regions , and six edges . 

(e) An undirected graph with chromatic number five. 
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GOALS 

In this chapter we will study the class of graphs called trees. Trees are frequently used in both mathematics and the sciences. Our solution of 
Example 2.1 is one simple instance. Since they are often used to illustrate or prove other concepts, a poor background in trees can be a serious 
handicap. For this reason, our ultimate goals are to: (1) define the various common types of trees, (2) identify some basic properties of trees, 
and (3) discuss some of the common applications of trees. 
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| 10.1 What Is a Tree? 



What distinguishes all trees from other types of graphs is the absence of certain paths called cycles . 

Definition: Cycle. A cycle is a circuit whose edge list contains no duplicates. 

The simplest example of a cycle in an undirected graph is a pair of vertices with two edges connecting them. Since trees are cycle-free, we can 
rule out all multigraphs from consideration as trees . 

Trees can either be undirected or directed graphs. We will concentrate on the undirected variety in this chapter. 

Definition: Tree. An undirected graph is a tree if it is connected and contains no cycles or self-loops. 
Example 10.1.1. 
(a) Graphs i, ii and iii in Figure 10.1 .1 are all trees, while graphs iv, v, and vi are not trees. 

(i) (iii) 

(ii) 




(iv) 



(vi) 






Figure 10.1.1 

Some trees and non-trees 

(b) A Ki is a tree. However, if n > 3, a K„ is not a tree. 

(c) In a loose sense, a botanical tree is a mathematical tree. There are no cycles in the branch structure of a botanical tree. 

(d) The structures of some chemical compounds are modeled by a tree. For example, butane (Figure 10.1 .2a) consists of four carbon molecules 
and ten hydrogen molecules, where an edge between two molecules represents a bond between them. A bond is a force that keeps two 
molecules together. The same set of molecules can be linked together in a different tree structure to give us the compound isobutane (Figure 
10.1.2b). There are some compounds whose graphs are not trees. One example is benzene (Figure 10.1.2c). 

(c) 





(b) 




(a) 
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H 

/ \ 

H 
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Figure 10.1.2 

Simple organic compounds 

One type of graph that is not a tree, but is closely related, is a forest. 

Definition: Forest. A forest is an undirected graph whose components are all trees. 
Example 10.1.2. The top half of Figure 10.1 .1 can be viewed as a forest of three trees. 
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We will now examine several conditions that are equivalent to the one that defines a tree. The following theorem will be used as a tool in 
proving that the conditions are equivalent. 

Theorem 10.1.1. Let G = (V, E) be an undirected graph with no self-loops, and let v a , v b e V. If two different simple paths exist between 
v a and v h , then there exists a cycle in G. 

Proof: Let p\ = (e iy e 2 , ■■■, e m ) and p 2 = (f\, f 2 , ■■■, /«) be two different simple paths from v a to v b . The first step we will take is to 
delete from p i and p 2 the initial edges that are identical . That is , if e [ = fi,e 2 = f 2 , . . . , ej = fj, and ej+ [ =/= f + 1 delete the first j edges of both 
paths. Once this is done, both paths start at the same vertex, call it v c , and both still end at v b . Now we construct a cycle by starting at v c and 
following what is left of p\ until we first meet what is left of p 2 . If this first meeting occurs at vertex v d , then the remainder of the cycle is 
completed by following the portion of the reverse of p 2 that starts at v d and ends at v c . ■ 

Theorem 10.1.2. Let G = (V, E) be an undirected graph with no self-loops and | V | = n. The following are all equivalent: 

(1) G is a tree. 

(2) For each pair of distinct vertices in V, there exists a unique simple path between them. 

(3) G is connected, and if e e E, then (V, E - {e\) is disconnected. 

(4) G contains no cycles, but by adding one edge, you create a cycle. 

(5) G is connected and \E \ = n - 1 . 

Proof Strategy. Most of this theorem can be proven by proving the following chain of implications: (1) => (2), (2) => (3), (3) => (4), and 
(4) => (1). Once these implications have been demonstrated, the transitive closure of = A > on {1, 2, 3, 4} establishes the equivalence of the 
first four conditions. The proof that Statement 5 is equivalent to the first four can be done by induction, which we will leave to the reader. 

Proof: (1) => (2) (Indirect). Assume that G is a tree and that there exists a pair of vertices between which there is either no path or there are at 
least two distinct paths. Both of these possibilities contradict the premise that G is a tree. If no path exists, G is disconnected, and if two paths 
exist, a cycle can be obtained by Theorem 10.1.1. 

(2) => (3). We now use Statement 2 as a premise. Since each pair of vertices in V are connected by exactly one path, G is connected. Now if we 
select any edge e in E, it connects two vertices, vi and v 2 . By (2), there is no simple path connecting vi to v 2 other than e. Therefore, no path at 
all can exist between Vi and v 2 in (V, E - {e}). Hence (V, E - {e}) is disconnected. 

(3) => (4). Now we will assume that Statement 3 is true. We must show that G has no cycles and that adding an edge to G creates a cycle. We 
will use an indirect proof for this part. Since (4) is a conjunction, by DeMorgan's Law its negation is a disjunction and we must consider two 
cases. First, suppose that G has a cycle. Then the deletion of any edge in the cycle keeps the graph connected, which contradicts (3). The second 
case is that the addition of an edge to G does not create a cycle. Then there are two distinct paths between the vertices that the new edge 
connects. By Theorem 10.1 .1 , a cycle can then be created, which is a contradiction. 

(4) => (1) Assume that G contains no cycles and that the addition of an edge creates a cycle. All that we need to prove to verify that G is a tree 
is that G is connected. If it is not connected, then select any two vertices that are not connected. If we add an edge to connect them, the fact that 
a cycle is created implies that a second path between the two vertices can be found which is in the original graph, which is a contradiction. ■ 

The usual definition of a directed tree is based on whether the associated undirected graph, which is obtained by "erasing" its directional arrows, 
is a tree. In Section 10.3 we will introduce the rooted tree, which is a special type of directed tree. 

EXERCISES FOR SECTION 10.1 
A Exercises 

1 . Given the following vertex sets, draw all possible undirected trees that connect them. 

(a) V a = {right, left) 

(b) V„ = { + , -, 0) 

(c) V c = (north, south, east, west). 

2. Are all trees planar? If they are, can you explain why? If they are not, you should be able to find a nonplanar tree. 

3. Prove that if G is a simple undirected graph with no self-loops, then G is a tree if and only if G is connected and | E | = I V I - 1 . 

4. (a) Prove that if G = (V, E) is a tree and e e E, then (V, E - \e\) is a forest of two trees. 

(b) Prove that if(Vi,2?i) and (V 2 , E 2 ) are disjoint trees and e is an edge that connects a vertex in V\ to a vertex in V 2 , then 
(V1UV2, Ey U E 2 U W) is a tree. 

5 . (a) Prove that any tree with at least two vertices has at least two vertices of degree 1 . 



(b) Prove that if a tree is not a chain, then it has at least three vertices of degree 1 . 

I 10.2. Spanning Trees 

The topic of spanning trees is motivated by a graph-optimization problem. 
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Example 10.2.1. A map of Atlantis (Figure 10.2.1) shows that there are four campuses in its university system. A new secure communications 
system is being installed and the objective is to allow for communication between any two campuses, to achieve this objective, the university 
must buy direct lines between certain pairs of campuses. Let G be the graph with a vertex for each campus and an edge for each direct line. 
Total communication is equivalent to G being a connected graph. This is due to the fact that two campuses can communicate over any number 
of lines. To minimize costs, the university wants to buy a minimum number of lines. 

North 




West 

East 



South 

Figure 10.2.1 

The Atlantis University System 

The solutions to this problem are all trees. Any graph that satisfies the requirements of the university must be connected, and if a cycle does 
exist, any line in the cycle can be deleted, reducing the cost. Each of the sixteen trees that can be drawn to connect the vertices North, South, 
East, and West (see Exercise lc of Section 10.1) solves the problem as it is stated. Note that in each case, three direct lines must be purchased. 
There are two considerations that can help reduce the number of solutions that would be considered. 

Objective 1: Given that the cost of each line depends on certain factors, such as the distance between the campuses, select a tree whose cost is 
as low as possible. 

Objective 2: Suppose that communication over multiple lines is noisier as the number of lines increases. Select a tree with the property that the 
maximum number of lines that any pair of campuses must use to communicate with is as small as possible. 

Typically, these objectives are not compatible; that is, you cannot always simultaneously achieve these objectives. In the case of the Atlantis 
university system, the solution with respect to Objective 1 is indicated with solid lines in Figure 10.2.1. There are four solutions to the problem 
with respect to Objective 2: any tree in which one campus is directly connected to the other three. One solution with respect to Objective 2 is 
indicated with dotted lines in Figure 10.2.1 . After satisfying the conditions of Objective 2, it would seem reasonable to select the cheapest of the 
four trees . 

Definition: Spanning Set, Spanning Tree. 

(a) Let G = (V, E) be a connected undirected graph. A spanning set for G is a subset E' of E such that (V, E') is connected. 

(b) IfE' is a spanning set for G and T = (V, E') is a tree, then T is called a spanning tree for G. 

Notes 

(a) If(V, E') is a spanning tree, \E'\ =\V\ - 1. 

(b) The significance of a spanning tree is that it is a minimal spanning 

set. A smaller set would not span the graph, while a larger set would have a cycle, which has an edge that is superfluous. 

For the remainder of this section, we will discuss two of the many topics that relate to spanning trees. The first is the Minimal Spanning Tree 
Problem, which addresses Objective 1 above. The second is the Minimum Diameter Spanning Tree Problem, which addresses Objective 2. 

THE MINIMAL SPANNING TREE PROBLEM 

Given a weighted connected undirected graph G = (V, E, w) , the minimal spanning tree problem is to find a spanning tree {V , E') for which 
£ w(e) is as small as possible. 

te£' 

Unlike many of the graph-optimization problems that we've examined, a solution to this problem can be obtained efficiently. It is a situation in 
which a greedy algorithm works. 

Definition: Bridge. Let G = (V , E) be an undirected graph and let [L, R) be a partition ofV. A bridge between L and R is an edge in E 
that connects a vertex in Lto a vertex in R. 

Theorem 10.2.1. Let G = (V, E, w) be a weighted connected undirected graph. Let V be partitioned into two sets L and R. If e' is a 
bridge of least weight between L and R, then there exists a minimal spanning tree for G that includes e* . 

Proof (Indirect proof): Suppose that no minimum spanning tree including e* exists. Let T = (V, E') be a minimum spanning tree. If we 
add e* to T, a cycle is created, and this cycle must contain another bridge, e, between L and R. Since w(e") < w(e), we can delete e and the 
new tree, which includes e* must also be a minimum spanning tree. ■ 
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Example 10.2.2. The bridges between the vertex sets {a, b, c) and \d, e) in Figure 10.2.2 are the edges {b, d) and {c, e). According to 
the theorem, a minimal spanning tree that includes {b, d] exists. By examination, you should be able to see that this is true. Is it true that only 
the bridges of minimal weight can be part of a minimal spanning tree? The answer is no. 




Figure 10.2.2 

Theorem 10.2.1 essentially tells us that a minimal spanning tree can be constructed recursively by continually adding minimally weighted 
bridges to a set of edges . 

Algorithm 10.2.1: Greedy Algorithm for Finding a Minimal Spanning Tree. Let G = (V, E, w) be a connected, weighted, undirected 
graph, and let v be an arbitrary vertex in V. The following steps lead to a minimal spanning tree for G. L and R will be sets of vertices and 
E' is a set of edges. 

(1) (Initialize) L = V - \v ); R = \v ); E = 0. 

(2) (Build the tree) While L t 

(2.1) Find e* = {v L , v R ), a bridge of minimum weight between L and R. 

(2.2) R = R U (v L ); L = L - \v L ) ; E' : = E'\J {e*} 
(3) Terminate with a minimal spanning tree (V, E'). 

Notes: 

(a) If more than one minimal spanning tree exists, then the one that is obtained depends on vo and the means by which e* is selected in Step 
2.1 if two minimally weighted bridges exist. 

(b) Warning: If two minimally weighted bridges exist between L and R, do not try to speed up the algorithm by adding both of them to E . 

(c) That Algorithm 10.2.1 yields a minimal spanning tree can be proven by induction with the use of Theorem 10.2.1 . 

(d) If it is not known whether G is connected, Algorithm 10.2.1 can be revised to handle this possibility. The key change (in Step 2.1) would 
be to determine whether any bridge at all exists between L and R. The condition of the while loop in Step 2 must also be changed somewhat. 

Example 10.2.3. Consider the graph in Figure 10.2.3. If we apply Algorithm 10.2.1 starting at a, we obtain the following edge list in the 
order given: {a, /}, {/, e), {e, c), (c, d], (/, b], {b, g\. The total of the weights of these edges is 20. The method that we have used (in 
Step 2.1) to select a bridge when more than one minimally weighted bridge exists is to order all bridges alphabetically by the vertex in L and 
then, if further ties exist, by the vertex in R. The first vertex in that order is selected in Step 2.1 of the algorithm. 




Figure 10.2.3 
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The Minimum Diameter Spanning Tree Problem 

Given a connected undirected graph G = (V, E), find a spanning tree T = (V, E') of G such that the longest path in T is as short as possible. 

Example 10.2.4. The Minimum Diameter Spanning Tree Problem is easy to solve in a K„. Select any vertex v u and construct the spanning tree 
whose edge set is the set of edges that connect v to the other vertices in the K„ . Figure 10.2.4 illustrates a solution for n = 5 . 




Minimum diameter spanning tree for K 5 

For incomplete graphs, a two-stage algorithm is needed. In short, the first step is to locate a "center" of the graph. The maximum distance from 
a center to any other vertex is as small as possible. Once a center is located, a breadth-first search of the graph is used to construct the spanning 
tree. The breadth-first search is essentially the broadcasting algorithm that we discussed in Section 9.3 applied to undirected graphs. 

EXERCISES FOR SECTION 10.2 
A Exercises 

1. Suppose that after Atlantis University's phone system is in place, a fifth campus is established and that a transmission line can be bought to 
connect the new campus to any old campus. Is this larger system the most economical one possible with respect to Objective 1? Can you always 
satisfy Objective 2? 

2. Construct a minimal spanning tree for the capital cities in New England (see Figure 9.5.1). 

3. Show that the answer to the question posed in Example 10.2.2 is "no." 

4. Find a minimal spanning tree for the following graphs. 

(a) 
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(b) 



BOS 




ATL 




5 . Find a minimum diameter spanning tree for the following graphs . 



(a) 




(b) 



21_ 



-17 



16 15 14- 



i r 4 - 
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I 10.3 Rooted Trees 



In the next two sections, we will discuss rooted trees. Our primary focuses will be on general rooted trees and on a special case, ordered binary 
trees. 

Informal Definition and Terminology: What differentiates rooted trees from undirected trees is that a rooted tree contains a distinguished 
vertex, called the root. Consider the tree in Figure 10.3.1 . Vertex A has been designated the root of the tree. If we choose any other vertex in the 
tree, such as M, we know that there is a unique path from A to M. The vertices on this path, (A, D, K, M), are described in genealogical terms: 

M is a child of K (so is L). 
K'k M's parent. 
A, D, and K are M's ancestors. 
D, K, and M are descendants of A. 




FIGURE 10.3.1 

Rooted tree 

These genealogical relationships are often easier to visualize if the tree is rewritten so that children are positioned below their parents , as in 
Figure 10.3.2. 

With this format, it is easy to see that each vertex in the tree can be thought of as the root of a tree that contains, in addition to itself, all of its 
descendants. For example, D is the root of a tree that contains D, K, L, and M. Furthermore, K is the root of a tree that contains K, L, and M. 
Finally, L and M are roots of trees that contain only themselves. From this observation, we can give a formal definition of a rooted tree. 

Root ► . 




FIGURE 10.3.2 

Rooted tree of Figure 10.3.1 , redrawn 



Definition: Rooted Tree. 

(a) Basis: 
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(i) A tree with no vertices is a rooted tree (the empty tree), 
(ii) A single vertex with no children is a rooted tree 

(b) Recursion: 

Let T\, T 2 , ... , T r , r > 1 , be disjoint rooted trees with roots v t , v 2 , ..., v r , respectively, and let v be a vertex that does not 
belong to any of these trees. Then a rooted tree, rooted at v , is obtained by making v the parent of the vertices V\, v 2 , ..., and 
v r . We call T\, T 2 , ... , T r , subtrees of the larger tree. 

The level of a vertex of a rooted tree is the number of edges that separate the vertex from the root. The level of the root is zero. The depth of a 
tree is the maximum level of the vertices in the tree. The depth of a tree in Figure 10.3.2 is three, which is the level of the vertices L and M. The 
vertices E, F, G, H, I, J, and K have level two. B, C, and D are at level one and A has level zero. 

Example 10.3.1. Figure 2.1.1, which we reproduce below is a rooted tree with Start as the root. It is an example of what is called a decision tree. 




Figure 2.1.1 

Example 10.3.2: Data Structures. One of the keys to working with large amounts of information is to organize it in a consistant, logical way. 
A data structure is a scheme for organizing data. A simple example of a data structure might be the information a college admissions depart- 
ment might keep on their applicants. Items in a flat file might look something like this: 

Applicantltem = {FirstName, Middlelnitial, LastName, StreetAddress, City, State, Zip, HomePhone, CellPhone, EmailAddress, 
HighSchool, Major, ApplicationPaid, MathSAT, VerbalSAT, Recommendation 1, Recommendation2, Recommendation3) 

A spreadsheet can be used to arrange data in this way. Although a flat file structure is often adequate, there are often advantages to clustering 
some the information. For example the applicant information is broken into four parts: name, contact information, high school, and application 
data. 

Applicantltem = {{FirstName, Middlelnitial, LastName), {{StreetAddress, City, State, Zip), {HomePhone, CellPhone), EmailAddress), 
HighSchool, {Major, ApplicationPaid, {MathSAT, VerbalSAT), {Recommendation 1, Recommendation2, Recommendation3))) 

The first item in the Applicantltem list is a list {FirstName, Middlelnitial, LastName}, with each item in that list being a single field of the 
original flat file. The third item is simply the single high school item from the flat file. The application data is a list and one of its items, is 
itself a list with the recommendation data for each recommendation the applicant has. 

The organization of this data can be visualized with a rooted tree such as the one in Figure 10.3.3. 
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Figure 10.3.3 

Structured Data 

In general, you can represent a data item, T, as a rooted tree with T as the root and a subtree for each field. Those fields that are more than just 
one item are roots of further subtrees, while individual items have no further children in the tree. 

KRUSKAL'S ALGORITHM 

An alternate algorithm for constructing a minimal spanning tree uses a forest of rooted trees . First we will describe the algorithm in its simplest 
terms. Afterward, we will describe how rooted trees are used to implement the algorithm. Finally, we will describe a simple data structure and 
operations that make the algorithm quite easy to program. In all versions of this algorithm, assume that G = (V ', E, w) is a weighted undi- 
rected graph with |V| = m and \E\= n. 

Kruskal's Algorithm— Version 1. 

(1) Sort the edges of G in ascending order according to weight. That is, 

i < j <^> w(ej) < w(ej). 

(2) Go down the list obtained in Step 1 and add edges to a spanning set (initially empty) of edges so that the set of edges does not form a 
cycle. When an edge that would create a cycle is encountered, ignore it. Continue examining edges until either m - \ edges have been 
selected or you have come to the end of the edge list. If m — 1 edges are selected, these edges make up a minimal spanning tree for G. If 
fewer than m - \ edges are selected, G is not connected. 

Note: Step 1 can be accomplished using one of any number of standard sorting routines. Using the most efficient sorting routine, the time 
required to perform this step is proportional to n log n. The second step of the algorithm, also of n log n time complexity, is the one that uses a 
forest of rooted trees to test for whether an edge should be added to the spanning set. 

Kruskal's Algorithm— Version 2. 

(1) Sort the edges as in Version 1 . 

(2) (2.1) Initialize each vertex in V to be the root of its own rooted tree. 

(2.2) Go down the list of edges until either a spanning tree is completed or the edge list has been exhausted. For each edge 
e = {vj, v 2 ), we can determine whether e can be added to the spanning set without forming a cycle by determining whether the root 
ofvj's tree is equal to the root ofv 2 's tree. If the two roots are equal, then ignore e. If the roots are different, then we can add e to 
the spanning set. In addition, we merge the trees that Vj and v 2 belong to. This is accomplished by either making V\ ' s root the 
parent ofv 2 's root or vice versa. 

Notes: 

(a) Since we start the algorithm with m trees and each addition of an edge decreases the number of trees by one, we end the algorithm 
with one rooted tree, provided a spanning tree exists. 

(b) The rooted tree that we obtain is not the spanning tree itself. 

Mathematica Implementation of Kruskal's Algorithm 

We will implement Kruskal's Algorithm with Mathematica. First we will describe a very simple data structure for representing the forest of 
trees that is maintained within the algorithm. All that is needed is a list of integers, one for each of the vertices in the graph. For simplicity we 
will take the vertices to be the integers 1 , 2, . . . , n and will illustrate with the case of n = 10 . If forest is the list, then 
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forest[ [k]]= 



j if j is k ' s parent in the forest 
if A: is the root of a tree in the forest 



For example, if forest is equal to (0, 1, 0, 6, 3, 3, 4, 0, 3, 3), the vertices 1, 3, and 8 are roots of trees. Figure 10.3.4 shows the forest, 
where directed edges are used to indicate parent hood. An edge from k to j indicates that j is k's parent. Roots are sinks in this representation 
of trees . 




' 



- 



By representing forests in this way, it is easy to program functions that give the root of a tree and merge two trees. Notice the recursive nature 
of the root function. The input cell below has been set to be not evaluatable. These functions will be defined locally within the Kruskal 
algorithm code. 

root [k_] := If [forest [ [k] ] == 0, k, root [forest [ [k] ] ] ] ; 
merge[vl_, v2_] := (forest [ [root [v2] ] ] =root[vl]); 

One way to represent a weighted graph in Mathematica is as a list of pairs {edge, weight) where the edge is a rule. For example, (4 -> 1, 6} 
would be a edge connecting vertices 4 and 1 with weight 6. The way we will program the algorithm, the direction is not important, and the 
graph is presumed undirected. For consistency, we seed the random number generator and then generate a random graph, wg, having 10 
vertices by "flipping a coin" to determine whether each edge is present. We also assign a assign random weight between 4 and 10 to each edge. 
These numbers are arbitrary and can be adjusted, if desired. 

SeedRandom[2010] ; 

wg = Map [{Rule @@fl, Randomlnteger [{4, 10}]} &, 

RandomGraph[10, 0.5] // EdgeList // Select [tt, (FirstfB] <Last[B]) &] &] 

Here is what the graph looks like: 
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graph = GraphPlot [wg, EdgeLabeling -» True, DirectedEdges -» False, VertexLabeling -> True, 
VertexCoordinateRules -» Map [ (B -> {Cos [Pi B/ 5] , Sin [Pi 8 / 5] }) &, Range [10] ] ] 




The following function definition presumes that the input is a weighted undirected graph that is represented in the {edge, weight) format 
described above. The output is a sublist that comprises a minimal spanning tree for the input. 

Kruskal [g_] := 
Module[{nV, vertexset, spanset, edgepool, forest, nextedge, root, merge, index}, 
root[k_] := If [forest[ [k] ] == 0, k, root [forest [ [k] ]]] ; 
merge [vl_, v2_] := (forest [ [root [v2] ] ] =root[vl]); 

vertexset = Union [Flatten [ (List @@8) & /@g [ [All, 1] ] ] ] ; nV = Length [vertexset] ; 
Map [ (index [vertexset [ [tt] ] ] = B) &, Range [nV] ] ; 

forest = Table [0, {nV}]; spanset = {}; edgepool = SortBy[g, Last]; 
While [ (Length [spanset] < nV - 1) && (edgepool + {}) , 
nextedge = First [edgepool] ; edgepool = Rest [edgepool] ; 
If [root [index [nextedge [ [1, 1]]]] + root [index [nextedge [ [1, 2 ] ] ] ] , AppendTo [ 

spanset, nextedge] ; merge [index [nextedge [ [1, 1]]], index [nextedge [ [1, 2]]]]]]; 
If [Length [spanset] ==nV-l, spanset, "Graph not connected - no spanning tree exists."]] 

We use the function to generate a spanning tree for our example: 



st = Kruskal [wg] 


i 1 ->2 
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Lets take a look at the spanning tree, with edges drawn in orange. 
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tree = GraphPlot [st, EdgeLabeling -» True, DirectedEdges -» False, VertexLabeling -> True, 
VertexCoordinateRules -» Map[ (B -> {Cos [Pi tt/5], Sin[Pitt/5]}) &, Range [10] ] , 
EdgeRenderingFunction -> ({Orange, Thickness [0.01] , Arrowheads [0] , Arrowftt]} &) ] ; 

Show [{tree, graph}] 




Example 10.3.3. Lets look at how long it take to complete the algorithm for a graph with 200 edges. Drawing the graph isn't of interest, 
it's mostly the time to generate the spanning tree we are interested in. 

SeedRandom[2010] ; 

largegraph = Map[ {Rule @@n, Randomlnteger [{4, 10}]} &, 

RandomGraph[200, 0.5] // EdgeList // Select [B, (First[8] < Last [tt] ) &] &] ; 

The time will vary depending on the computer you are using. The expression First[Timing[caIculation;]] will perform a calculation and 
just display the CPU time needed, in seconds, to complete the calculation. 

{First [Timing [largetree = Kruskal [largegraph] ;]] , $System} 

{0.579841 , Mac OS X x86 (64-bit)) 
Just for fun, here is what the spanning tree looks like. 
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GraphPlot [largetree, EdgeLabeling -> False] 




Example 10.3.4. Lets conclude with an example that is based on some real data, using Wolfram's Computable database of city data. 
We will build a minimal spanning tree for the large cities of France, where "large" is taken to mean a population of 100,000 or more. The 
weights of edges between cities will the distance between them. First here is that list of cities. By the way, you can change "France" to any 
other country or region and get similar results . 
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citylist = CityData [{Large, "France"}] 



Paris 


IleDeFrance 


France 


Marseille 


ProvenceAlpesCoteDAzur 


France 


Lyon 


RhoneAlpes 


France 


Toulouse 


MidiPyrenees 


France 


Nice 


ProvenceAlpesCoteDAzur 


France 


Nantes 


PaysDeLaLoire 


France 


Strasbourg 


Alsace 


France 


Montpellier 


LanguedocRoussillon 


France 


Bordeaux 


Aquitaine 


France 


Lille 


NordPasDeCalais 


France 


Rennes 


Bretagne 


France 


LeHavre 


HauteNormandie 


France 


Reims 


ChampagneArdenne 


France 


SaintEtienne 


RhoneAlpes 


France 


Toulon 


ProvenceAlpesCoteDAzur 


France 


Grenoble 


RhoneAlpes 


France 


Angers 


PaysDeLaLoire 


France 


Dijon 


Bourgogne 


France 


Brest 


Bretagne 


France 


LeMans 


PaysDeLaLoire 


France 


Nimes 


LanguedocRoussillon 


France 


AixEnProvence 


ProvenceAlpesCoteDAzur 


France 


Limoges 


Limousin 


France 


ClermontFerrand 


Auvergne 


France 


Tours 


Centre 


France 


Amiens 


Picardie 


France 


Villeurbanne 


RhoneAlpes 


France 


Metz 


Lorraine 


France 


Besancon 


FrancheComte 


France 


Perpignan 


LanguedocRoussillon 


France 


Orleans 


Centre 


France 


Rouen 


HauteNormandie 


France 


Mulhouse 


Alsace 


France 


Caen 


BasseNormandie 


France 


oulogneBillancourt 


IleDeFrance 


France 


Nancy 


Lorraine 


France 


Montreuil 


IleDeFrance 


France 


Argenteuil 


IleDeFrance 


France 



As part of the geographic database, there is a functional called GeoDistance that we will use to define a function that tells us how many 
kilometers separate any two cities . 

CityDistance [cityl_, city2_] := 
GeoDistance [CityData [cityl , "Coordinates"], CityData [city2 , "Coordinates"]] / 1000 

Next, we generate the weighted graph. 

g = Map [ {First [B[ [1] ] ] -» First [B[ [2] ] ] , CityDistance @@ 8} &, Tuples [citylist, 2] ] // 
Select[B, Last [8] > &] &; 

In France there are large cities and drawing the K x produces a largely uninteresting figure. By drawing only the edges for cities within 300 
km of one another, the figure is a little more interesting. The cluster in the north central part of France is greater-Paris, where several other 
adjacent cities are also large. We use the longitude and latitude of each city to plot a location on the plane. 
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GraphPlot[g // Select [B, Last [tt] < 300 &] &, EdgeLabeling -> False, 
VertexCoordinateRules -» Map [ (First [tt] -» Reverse [CityData [tt, "Coordinates"]]) &, citylist] ] 




Now we use our Kruskal function to generate a spanning tree. Notice the first few edges in the list are of very close cites. 
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span = Kruskal [g] 

Villeurbanne -> Lyon 4.04522 

Montreuil -> Paris 6.6039 

BoulogneBillancourt -» Paris 8.06249 

Argenteuil -» Paris 1 1 .5285 

Marseille -» AixEnProvence 25 .09 1 

Montpellier -> Nimes 46.3561 

Nancy ^ Metz 47.825 

Marseille -> Toulon 48.9574 

LeHavre -> Caen 49.043 

Lyon -> SaintEtienne 50.2382 

Rouen -> LeHavre 69.9004 

Besancon -» Dijon 75.559 

Tours -> LeMans 78 . 1 345 

LeMans -> Angers 80.1425 

Angers -> Nantes 82.6313 

Villeurbanne -> Grenoble 92.0243 

Nimes -» AixEnProvence 94.3932 

Mulhouse -» Strasbourg 96.3815 

Rennes -» Nantes 98.1899 

Amiens -» Lille 98.9384 

Argenteuil -» Rouen 101.2 

Rouen -» Amiens 101.847 

Orleans -» BoulogneBillancourt 106.438 

Tours -> Orleans 107.746 

SaintEtienne -» ClermontFerrand 109.348 

Besancon -» Mulhouse 115.034 

Strasbourg -» Nancy 117.825 

Reims -> Montreuil 124.727 

Toulon -> Nice 126.35 

Montpellier -> Perpignan 128.738 

ClermontFerrand -» Limoges 142.36 

Toulouse -» Perpignan 155.437 

Metz^ Reims 157.396 

Villeurbanne -> Dijon 173.794 

SaintEtienne -» Nimes 176.717 

Bordeaux -» Limoges 180.776 

Brest -» Rennes 211.714 

Here are a couple of graphs of the spanning tree. The first shows city names, but would need to be blown up to see them all. 
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GraphPlot [span, EdgeLabeling -> False, VertexLabeling -» True, 
VertexCoordinateRules -» Map [ (First [B] -» Reverse [CityData [tt, "Coordinates"]]) &, citylist] ] 



Caen 

Bol Argente 

Rennes LeMans ^^ 

Nantes 111£erS T ™ rs 



:l -^ Strasbourg 



Dijon 



Limoges lermontFerrand 



Toulouse Mon ; n Pr™ ^p IN ice 

"^ ^ Toulon 

Perpignan 

The same graph with VertexLabeling shut off shows the tree's structure. 

GraphPlot [span, EdgeLabeling -» False, VertexLabeling -» False, 
VertexCoordinateRules -» Map [ (First [B] -» Reverse [CityData [tt, "Coordinates"]]) &, citylist]] 




EXERCISES FOR SECTION 10.3 
A Exercises 

1 . Suppose that an undirected tree has diameter d and that you would like to select a vertex of the tree as a root so that the resulting rooted tree 
has the smallest depth possible. How would such a root be selected and what would be the depth of the tree (in terms of d)l 

2. Use Kruskal's algorithm to find a minimal spanning tree for the following graphs. In addition to the spanning tree, find the final rooted tree 
in the algorithm. When you merge two trees in the algorithm, make the root with the lower number the root of the new tree. 

(a) 
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(b) 




C Exercises 

3. Suppose that information on buildings is arranged in records with five fields: the name of the building, its location, its owner, its height, and 
its floor space. The location and owner fields are records that include all of the information that you would expect, such as street, city, and state, 
together with the owner's name (first, middle, last) in the owner field. Draw a rooted tree to describe this type of record, 

4. (Requires Mathematical Given a country with n large cities evenly distributed in an area of A square kilometers, one would expect that the 
total length of the spanning tree for the cities would roughly depend on n and A. In the case of France, n = 37 cities, A = 547030 km - , and the 
length of the spanning tree is 350 1 .59 km. Look for such a relationship by collecting similar data for other countries or generating random data. 
To get areas of countries you can use the CountryData function. 

CountryData [ "France" , "Area" ] 

547030. 

Caution: Oddly shaped countries such as Chile, or countries that have large uninhabited areas such as Brazil will probably not fit any proposed 
model. 



I 10.4 Binary Trees 



An ordered rooted tree is a rooted tree whose subtrees are put into a definite order and are, themselves, ordered rooted trees. An empty tree and 
a single vertex with no descendants (no subtrees) are ordered rooted trees . 

Example 10.4.1. The trees in Figure 10.4.1 are identical rooted trees, with root 1 , but as ordered trees, they are different. 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed under a Creative Commons Attribution-Noncommercial-No 



20 f^lO.nb 



239 



(a) 



(b) 





10 



10 



Figure 10.4.1 

Two different ordered rooted trees 



;P 



subtrees. If we restrict the number of subtrees of each 



If a tree rooted at v has p subtrees, we would refer to them as the first, second, 
vertex to be less than or equal to two, we have a binary (ordered) tree. 

Definition: Binary Tree. A binary tree is 

(a) a tree consisting of no vertices (the empty tree), or 

(b) a vertex with two subtrees that are both binary trees. The subtrees are called the left and right subtrees. 

The difference between binary trees and ordered trees is that every vertex of a binary tree has exactly two subtrees (one or both of which may 
be empty), while a vertex of an ordered tree may have any number of subtrees. The two trees in Figure 10.4.2 would be considered identical as 
ordered trees; however, they are different binary trees. Tree (a) has an empty right subtree and Tree (b) has an empty left subtree. 



(a) 



(b) 




Terminology and General Facts: 

(a) A vertex of a binary tree with two empty subtrees is called a leaf. All other vertices are called internal vertices. 

(b) The number of leaves in a binary tree can vary from one up to roughly half the number of vertices in the tree (see Exercise 4 of this 
section). 

(c) The maximum number of vertices at level k of a binary tree is 2 k , k > (see Exercise 6 of this section) . 

(d) A full binary tree is a tree in which each vertex has either zero or two empty subtrees . In other words , each vertex has either two or zero 
children. See Exercise 7 of this section for a general fact about full binary trees. 

TRAVERSALS OF BINARY TREES 

The traversal of a binary tree consists of visiting each vertex of the tree in some prescribed order. Unlike graph traversals, the consecutive 
vertices that are visited are not always connected with an edge. The most common binary tree traversals are differentiated by the order in which 
the root and its subtrees are visited. The three traversals are best described recursively and are: 

(1) Preorder Traversal: 

(a) Visit the root of the tree. 

(b) Preorder traverse the left subtree. 

(c) Preorder traverse the right subtree. 

(2) Inorder Traversal: 

(a) Inorder traverse the left subtree. 

(b) Visit the root of the tree. 

(c) Inorder traverse the right subtree. 

(3) Postorder Traversal: 

(a) Postorder traverse the left subtree. 

(b) Postorder traverse the right subtree. 

(c) Visit the root of the tree. 
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Any traversal of an empty tree consists of doing nothing. 

Example 10.4.2. For the tree in Figure 10.4.3, the orders in which the vertices are visited are: 
A-B-D-E-C-F-G, for the preorder traversal 
D-B-E-A-F-C-G, for the inorder traversal and 
D-E-B-F-G-C-A, for the preorder traversal. 




Figure 10.4.3 

Example 10.4.3: Binary Tree Sort. Given a collection of integers (or other objects than can be ordered), one technique for sorting is a 
binary tree sort. If the integers are a\ , a%, , , , , a n , n> 1 , we first execute the following algorithm that creates a binary tree: 

(1 ) Insert a\ into the root of the tree. 

(2) For k := 2 to n //insert a k into the tree 

(2J)r = a lt 

(2 .2) inserted = false 

(2.3) While Not( inserted) Do 
Ifa k <r then 

ifr has a left child 

then r = left child ofr 

else make a k the left child ofr; inserted = true 

else II a k > r 

ifr has a right child 

thenr = right child of r 

else make a k the right child ofr; inserted = true 

If the integers to be sorted are 25, 17, 9, 20, 33, 13, and 30, then the tree that is created is the one in Figure 10.4.4. The inorder traversal of this 
tree is 9, 13, 17, 20, 25, 30, 33, the integers in ascending order. In general, the inorder traversal of the tree that is constructed in the algorithm 
above will produce a sorted list. The preorder and postorder traversals of the tree have no meaning here. 

Figure 10.4.4 

EXPRESSION TREES 

A convenient way to represent an algebraic expression is by its expression tree. Consider the expression 

X = a*b - c/d + e. 

Since it is customary to put a precedence on multiplication/divisions, X is evaluated as ((a*b) - (c/d)) + e. Consecutive multiplication/divi- 
sions or addition/subtractions are evaluated from left to right. We can analyze X further by noting that it is the sum of two simpler expressions 
(a*b) - (c/d) and el. The first of these expressions can be broken down further into the difference of the expressions a*b and c/d. When we 
decompose any expression into 
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(left expression) (operation) (right expression), 

the expression tree of that expression is the binary tree whose root contains the operation and whose left and right subtrees are the trees of the 
left and right expressions, respectively. Additionally, a simple variable or a number has an expression tree that is a single vertex containing the 
variable or number. The evolution of the expression tree for expression X appears in Figure 10.4.5. 



* 




Figure 10.4.5 



Example 10.4.4. 



(a) If we intend to apply the addition and subtraction operations in X first, we would parenthesize the expression to a * (b - c)/(d + e). Its 
expression tree appears in Figure 10.4.6a. 

(b) The expression trees for a 2 - b 2 and for (a + b) * (a - b) appear in Figures 10.4.6(b) and 10.4.6(c). 
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(a) 




d e 



(b) 



/ 




Figure 10.4.6 

The three traversals of an operation tree are all significant. A binary operation applied to a pair of numbers can be written in three ways. One is 
the familiar infix form, such as a + b for the sum of a and b. Another form is prefix, in which the same sum is written +a b. The final form is 
postfix, in which the sum is written ab +. Algebraic expressions involving the four standard arithmetic operations (+, -, *, and /) in prefix 
and postfix form are defined as follows: 

Prefix: (a) A variable or number is a prefix expression, (b) Any operation followed by a pair of prefix expressions is a prefix expression. 
Postfix: (a) A variable or number is a postfix expression, (b) Any pair of postfix expressions followed by an operation is a postfix expression. 
The connection between traversals of an expression tree and these forms is simple: 

(a) The preorder traversal of an expression tree will result in the prefix form of the expression. 

(b) The postorder traversal of an expression tree will result in the postfix form of the expression. 

(c) The inorder traversal of an operation tree will not, in general, yield the proper infix form of the expression. If an expression requires 
parentheses in infix form, an inorder traversal of its expression tree has the effect of removing the parentheses. 

Example 10.4.5. The preorder traversal of the tree in Figure 10.4.5 is + -*ab/cde, which is the prefix version of expression X. The postfix 
traversal is ab*c dj -e +. Note that since the original form of X needed no parentheses, the inorder traversal, a *b - c/d + e, is the correct 
infix version. 

COUNTING BINARY TREES 

We close this section with a formula for the number of different binary trees with n vertices . The formula is derived using generating functions . 
Although the complete details are beyond the scope of this text, we will supply an overview of the derivation in order to illustrate how generat- 
ing functions are used in advanced combinatorics. 

Let B (n) be the number of different binary trees of size n (n vertices), n > 0. By our definition of a binary tree, B (0) = 1. Now consider any 
positive integer n + 1, n > 0. A binary tree of size n + 1 has two subtrees, the sizes of which add up to n. The possibilities can be broken 
down into n + 1 cases: 

Case 0: Left subtree has size 0; right subtree has size n. 
Case 1: Left subtree has size 1; right subtree has size n - 1. 



Case k: Left subtree has size k; right subtree has size n 
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Case n: Left subtree has size n; right subtree has size 0. 

In the general Case k, we can count the number of possibilities by multiplying the number of ways that the left subtree can be filled, B (k), by 
the number of ways that the right subtree can be filled. B(n - k). Since the sum of these products equals B (n + 1), we obtain a recurrence 
relation for n > 0: 

B(n + 1) = B(Q)B(n) + B(l)S(n - 1) + ■■• + B(n)B(Q) 
= ^B(k)B(n-k) 

k=0 

Now take the generating function of both sides of this recurrence relation: 
£ B(n + 1) z" = E ( E B(k) B(n - k)\ z" 

n=0 n=0 U = 



G(BT; z) = G(B*B; z) = G(B; z) 



Recall that G(B T; z) = G{B ' Z \ g(0) = G(fe) ' If we abbreviate G (S; z) to G, we get 

— =G 2 ^ zG 2 -G+l=0 
Using the quadratic equation we get two solutions: 



G\ = — and Gi 



The gap in our deviation occurs here since we don't presume calculus at the level needed in the next step. If we expand G\ by its Laurent series, 
we find 

= i±^ll = l_ 1 _ z _ 2z 2_ 5z 3_ 14 ,4_ 42z 5 + ... 



The coefficients of the powers of all z are all negative and there is singularity at because of the - term. However if we do the same with G^ 
we get 



VI 4 Z = j + 2 2 +5 3 + 14 4 +42 5 + ... 

2z 

Further analysis leads to a closed form expression for B («), which is 
l i In' 



B(n) 

"+i \ n 

This sequence of numbers is often called the Catalan numbers. For more information on the Catalan numbers, see the entry A000108 in The 
On-Line Encyclopedia of Integer Sequences. 



# 



Mathematics Note 



It may be of interest to note how the Laurent expansions of G\ and Gz are determined using Mathematica. The function Series will generate 
a finite number of terms. For example for Gi we evaluate the expression 

r l+ Vl-4z , 

Series , {z, 0, 5} 

1 

- - 1 - z - 2z 2 - 5 £ - 14/ - 42z 5 + 0(z 6 ) 

7 

The first argument is the closed form expression for G\. The second argument has three parts. The first part is the variable. The second part is 
the center of the expansion , , since we want a sum in powers of z to get the coefficients of those powers . The third part is the maximum power 
of the variable. Notice that the output also include the term 0(z 6 ) to indicate that the remaining terms are all multiples of z 6 ■ If we wanted the 
first three terms of the expansion around - 1 , which is a sum of powers of z + 1 , we would get 
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r l + Vl -4z , 

Series , {z, - 1, 3} 

1 V5~l ( 1 3 'l f 1 13 "1 f 1 61 

2 2 



+ 1- |fe + l) + |- |(z+D 2 + 



2V5~J I 2 10V5~J V 2 50V5~J 



(Z + l) 3 + 0((z + l) 4 ) 



I Sage Note 



In Sage, one has the capability of being very specific about how algebraic expressions should be interpreted. This also makes working with 
various algebraic expressions a bit more confusing to the beginner. Here, without getting into a lot of detail, is how to get a Laurent expansion 
for G\ above. 

sage : R.<z>=PowerSeriesRing(ZZ, ' z ' ) 

sage: G=(l+sqrt(l-4*z) )/ (2*z) 

sage : G 

z A -l - 1 - z - 2*z"2 - 5*z"3 - 14*z"4 - 42*z"5 - 132*z"6 - 429*z"7 - 1430*z"8 - 4862*z"9 - 

16796*z~10 - 58786*z"ll - 208012*z"12 - 742900*z"13 - 2674440*z"14 - 9694845*z" 15 - 

35357670*z"16 - 129644790*z"17 - 477638700*z" 18 + 0(z"19) 

The first output above declares a structure called a ring that contains power series. Here we are not using that structure, just a specific element, 
G. So the important thing about this first input is that it establishes z as being a variable associated with power series. When the second 
expression defines the value of G in terms of z, is automatically converted to a power series. Notice that the final output does match the 
Mathematica result, but with more terms displayed. In Chapter 16 we will introduce rings and will be able to take advantage of Sage's 
capabilities in this area and this will probably make more sense. 

EXERCISES FOR SECTION 10.4 
A Exercises 

1 . Draw the expression trees for the following expressions: 

(a) a (b + c) 

(b) ab + c 

(c) ab + ac 

(d) bb - Aac 

(e) ((03 x + a-i) x +ai)x + Oq 

2. Draw the expression trees for 

(a) — 

X— 1 

(b) xy + xz + yz 

3. Write out the preorder, inorder, and postorder traversals of the trees in Exercise 1 above. 

4. Verify the formula for B(k), < k < 3 by drawing all binary trees with three or fewer vertices. 

5 . (a) Draw a binary tree with seven vertices and only one leaf. 

(b) Draw a binary tree with seven vertices and as many leaves as possible. 

B Exercises 

6. Prove that the maximum number of vertices at level k of a binary tree is 2 k and that a tree with that many vertices at level k must have at 
least 2* +1 - 1 vertices. 

7. Prove that if T is a full binary tree, then the number of leaves of T is one more than the number of internal vertices (nonleaves). 

8 . Use Mathematica to determine the sequence whose generating function is G(z) = j 
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SUPPLEMENTARY EXERCISES FOR CHAPTER 10 

Section 10.1 

1 . Show that in a tree with n vertices, the sum of the degrees of all vertices is 2 n - 2. 

2. (a) Show that there exists a tree with ten vertices and the property that each vertex has degree either 1 or 5. 
(b) Prove that no such tree exists with an odd number of vertices. 

3. Given G = (V, E) with |V| = v and \E\ = e, G is graceful if the elements in V can be labeled with v distinct positive integers such that for each 
positive integer k, 1 < k < e, there is an edge connecting vertices i and j such that \i - j\ = k. 

(a) Which of the following graphs are graceful? 



C 2 




V'l V'2 V'3 

VI 

G 3 G 4 

V'3 V 4 V2 V4 




(b) Prove that every chain (see Figure 9.6.1) is graceful. It has been conjectured that every tree is graceful. 

Section 10.2 

4. Let G be the graph 




Find a minimal spanning tree for G using the method of left and right sets. Start with R = {a} and L = [b, c, d, e). At each step show what L 
and R are and indicate what edge you added to the tree. 

Section 10.3 

5. Find a minimal spanning tree for the graph below. Use Kruskal's algorithm and draw the forest of rooted trees after you have added the third 
edge to the spanning set. 
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6. If a rooted tree has the properties that each vertex has no more than m children and its depth is less than or equal to n, how many vertices 
could it have? 

Section 10.4 

7. Represent the algebraic expression (((a - b)*c) + 7)*((d + 4)/x) by a tree. Determine the depth of the tree. 

8. (a) Write out the post order traversal for the graph below. 

(b) Write out the inorder traversal for the graph below. 

(c) Write out the preorder traversal for the graph below . 



9. (a) Draw the operation tree for the expression a 2 + 2 ab + b, where all multiplications are done first, x 2 = xx, and additions are done from left 
to right (as usual) . 

(b) List the postorder traversal of the tree that you obtained in part a. What is the significance of this traversal? 

10. (a) Draw the binary tree that would be constructed by the binary sort algorithm for sorting the integers 5, 55, 34, 38, 11, 3, 71, 23, and 18 
in descending order. Afterwards, list the preorder traversal of the tree and then build a sorting tree from that order. Repeat the process above, 
but do an inorder traversal instead of preorder. 

(b) Based on the results of part a, what can you say about the process of building a binary sorting tree, storing it in preorder (or inorder), and 
then building a tree from that new list? 
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Solutions and Hints to Odd-Numbered Exercises 



CHAPTER 1 
Section 1 .1 

l.(a) 8, 15, 22, 29 

(b) apple, pear, peach, plum These solutions are not unique. 

(c) 1/2, 1/3, 1/4, 1/5 

(d) -8, -6, -4, -2 
(e)6, 10, 15, 21 

3.(a) {2k + 1 :k e Z, 2 =s k =s 39} (b) {x eQ : -1 < x < 1} 

(c) (2n:neZ) (d)|9n:neZ,-2sn| 

5 .(a) True (b) False (e) True (d) True (e) False 

(fj True (g) False (h) True 

Section 1 .2 

l.(a){2, 3} (b){0, 2, 3) (c) {0,2,3} (d){0, 1, 2, 3, 5, 9} 

(e){0) (f)0 (g){ 1,4, 5,6, 7, 8,9} (h) {0, 2, 3, 4, 6, 7, 8) 

(i) (j) (0) 

3. These are all true for any sets A, B, and C. 

5.(a){l, 4) c A c {1, 2, 3, 4} 

(b) (2) c A c {1, 2, 4, 5) (c) A = {2, 4, 5} 

7. 





D 




> ) 
/ \J 


<*> 




\ 





w 





9. (a) 
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? Select 



Select[tof, crit] picks out all elements e, of list for which crit[ei\ is True. 
Select[/wf, a-it, n] picks out the first n elements for which crit[ej] is True. » 

? PrimeQ 



PrimeQ[e.vpr] yields True if expr is a prime number, and yields False otherwise. » 

(b) 

Select [Range [2000, 2099], PrimeQ[tt] &] 

{2003, 2011, 2017, 2027, 2029, 2039, 2053, 2063, 2069, 2081, 2083, 2087, 2089, 2099) 

Section 1.3 

l.(a){(0, 2), (0, 3), (2, 2), (2, 3), (3, 2), (3, 3)} 

(b){(2, 0), (2, 2), (2, 3), (3, 0), (3, 2), (3, 3)} 

(c) ((0, 2, 1), (0, 2, 4), (0, 3, 1), (0, 3, 4), (2, 2, 1), (2, 2, 4), (2, 3, 1), (2, 3, 4), (3, 2, 1), (3, 2, 4), (3, 3, 1), (3, 3, 4)) 

(d)0 

(e) {(0, 1), (0, 4), (2, 1), (2, 4), (3, 1), (3, 4)) 

(f){(2, 2), (2, 3), (3, 2), (3, 3)) 

(g) ((2, 2, 2), (2, 2, 3), (2, 3, 2), (2, 3, 3), (3, 2, 2), (3, 2, 3), (3, 3, 2), (3, 3, 3)) 

(h){(2, 0), (2, {2}), (2, (3)), (2, {2, 3)), (3, 0), (3, {2}), (3, (3)), (3, {2, 3))) 

3.(a, b), {a, c), (a, rf}, \b, c), {b, d), (c, d) 

5. There are n singleton subsets, one for each element. 

7. (a) {+00, +01, +10, +11, -00, -01, -10, -11} (b) 16and512 

9. When A = B 

Section 1 .4 

l.(a) 11111 (b) 100000 (c) 1010 (d) 1100100 

3. (a) 18 (b) 19 (c) 42 (d) 1264 

5 There is a bit for each power of 2, starting with the zeroth power. The number 

1990 is between 2 10 = 1024 and 2 11 = 2048, so there are 10 + 1 (the power 
of 2) bits. 

(a) 11 (b) 12 (c) 13 (d) 8 
7. It must be a multiple of four. 

Section 1 .5 

l.(a) 24 (b) 6 (c) 3,7,15,31 (d)l,4, 9, 16 

... 1 1 1 In 

3. (a) H + v ■■■ -\ = — 

1(1 + 1) 2(2+1) 3(3+1) k(h+I) «+l 

., . 1 1 1 1113 3 

(b) H + =-+-+—=-= 

1(2) 2(3) 3(4) 2 6 12 4 3+1 



(c)l +2 3 + 3 3 + — +n 3 =(-)n 2 (n+ l) 2 

1+4 + 27 = 36 (-) (3) 2 (3 + 1) 2 = 36 

5.(x + >')" = ( ",)*" + ( " l )x"-'y + ('{)x"- 2 y 2 + ••• + ( ;;_,)!)>"-' +( "Jy" 
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7.(a){xeQ|0<x«5) (b) 

(c){*eQ|-5<jc<5}=B 3 (d) {x eQ | -1 <x< 1} =B t 
9 .(a) 36 (b) 105 

Supplementary Exercises— Chapter 1 

l.(a){2, 1) (b)0 (c){i,-i} 

3. (a) {0, 3,4, 5,6, 7,8,9) (b) (3, 6, 9) (c) (0,1,2) 

5. (a) A US = {1,2, 3,4,5,6,7,9, 12} => |(A U B)\ = 9 
\A\ = 6, \B\ = 5 A n 5 = (2, 5) => |(A f] B)\ = 2 
|A| + |S| - |(A n B)l = 6 + 5 - 2 = 9 

(b) 10,8,2 

(c) (A U S U O = |((A U B) U Ol 

by part (a) = |(A U 5)1 + \C\ - |((A U B) D Ol 
Distributive = |(A U 5)1 + |C| - |((A f| O U (J» D 0)1 
by part (a) = |A| + |B| - |(A fl B)l + \C\ - [|(A fl Ol 

+ KB n oi - k(a n o n (b n o)u 

Simplify = |A| + \B\ + \C\ - \{A f] B)\ - \{A ft C)\ 

-\(Bf]C)\ + \(Af]Bf]C)\ 
7. (a) {(4, 4)) 

(b) {(2, 4), (4, 4), (6, 4)) 

(c) {(4,4,4)) 

(cl) {(4, 2), (4, 4), (4, 6)) 

(e) {(2, 4, 1), (2, 4, 5), (4, 4, 1), (4, 4, 5), (6, 4, 1), (6, 4, 5)} 

15. (a) A = {0), A, ={0, 1,2,3), A 2 = {0, 1,2, ...,6), A3 = {0, 1,2, ..., 9) 

(b) (0,1,2,3) 

(c) {0) 

(d) {0,1,2,..., 9) 

(e) {0,1,2,..., 9) 

17. Parts a, b, and d are true with multiplication replacing addition. 

CHAPTER 2 
Section 2.1 

1 . If there are m horses in race 1 and n horses in race 2 then there are m ■ n possible daily doubles. 

3.72 = 4-6-3 

5.720 = 6-5-4-3-2-l 

7 . If we always include the blazer in the outfit we would have 6 outfits . If we 

consider the blazer optional then there would be 12 outfits. When we add a 

sweater we have the same type of choice. Considering the sweater optional 

produces 24 outfits. 

9. (a) 2 8 = 256 (b) 2 4 = 16. Here we are concerned only with the first four 
bits, since the last four must be the same. 

(c) 2 7 = 128, you have no choice in the last bit. 
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11. (a) 16 


(b)30 


13. (a) 3 


(b)6 


15. 18 




17. (a) 






(b)5 6 

19.2"-' - land 2" -2 

Section 2.2 

l.P(1000;3) 

3. With repetition: 26 8 » 2.0883 x 10 1 ' 

Without repetition: P(26; 8) « 6.2991 x 10 10 

5. 15! 

7.(a)P(15;5) = 360360 (b) 2- 14- 13- 12-11 =48048 

9.2-P(3;3)= 12 

1 1 . (a) P(4; 2) = 12 (b) P(n; 2) = n(n - 1) 

(c) Case 1: m > n. Since the coordinates must be different, this case is impossible. 

Case 2: m ^ n.P(n; m). 

Section 2.3 

I. {{a}, {ft}, {c}}, {{a. ft}, {c}}, {{a, c}. {6}}, {{a), {b, c}}, {{a, b, c}} 

3. No. By this definition it is possible that an element of A; might not belong to A. 

5. The first subset is all the even integers and the second is all the odd integers. 

These two sets do not intersect and they cover the integers completely. 

7. Let J, A, N stand for the set of people who jog, do aerobics, and do Nautilus. 

Then, using law of addition #2, 

90 = 30 + 30 + 30 - 25 - 20 - 10 + |(7 n A f] N)\. 

9. Assume |(A, U A 2 )| = |A,| + |A 2 | - |(A, f] A 2 )|. 

|(A 1 UA 2 UA 3 )| = |((A 1 UA 2 )UA 3 )| 

by 1st law = |(A] U A 2 )| + \A 3 \ - |((A, U A 2 ) f] A 3 )\ 

Distributive = |(A, U A 2 )| + \A 3 \ - |((A, R A 3 ) U (A 2 fl A 3 ))| 

1st law (twice) = |A,| + |A-,| - |(Ai fl A 2 )| + |A 3 | 

-[ka, n a 3 )i + i(A 2 n a 3 )i - k(a, n a 3 ) n (a 2 n a 3 ))u 
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Simplify = |A,| + \A 2 \ + |A 3 | - |(A, D A 2 )| - |(A, D A,)| 

-KAzflAs^ + KAjnAjnAa)!. 

(b) l(A] U A 2 U A 3 U A 4 )| = |A,| + |A 2 | + |A 3 | + IA4I - |(A, f] A 2 )\ 

-i(A, n a 3 )i - ka, n a,)i - ka 2 n a 3 )i - ka 2 n a 4 )i 

-i(A 3 n a 4 )i + ka, n a 2 n a 3 )i + ka, n a 2 n aoi 

+ka, n a 3 n a,)i + ka 2 n a 3 n a 4 )i - Ui n a 2 n a 3 n a 4 )i 

Derivation: 

KA] U A 2 U A 3 U A 4 )| = |((A, U A 2 U A 3 ) U A 4 )| 

1st law = |(Aj U A 2 U A 3 )| + |A 4 | - |«Ai U A 2 U A 3 ) fl A,)l 

Distributive = |(A, U A 2 U A 3 )| + |A 4 | - |((Ai R A,) U (A 2 D A4) 

U (A3 n A 4 ))| 

2nd law (twice) = |A,| + |A 2 | + |A 3 | - |(A, R A 2 )| - |(A] R A 3 )| 

-ka 2 n a 3 )i + ka, n a 2 n a 3 )i + ia,i - 1^ n a 4 )i 
+ ka 2 n a 4 )i + ka 3 n a 4 )i - k(a, n a,) n (a 2 n a 4 ))i 
- i«A, n a 4 ) n (a 3 n a,))i - i«a 2 n ao n (a 3 n a 4 ))i 
+ i«A, n a 4 ) n (a 2 n a,) n (a 3 n mm 

Simplify = |A,| + |A 2 | + |A 3 | + |A 4 | - |(A, R A 2 )| - KA] f] A 3 )| 

-ka 2 n a 3 )i - KA] n a 4 )i - ka 2 n a 4 )i - ka 3 n a 4 )i 
+i(a, n a 2 n a 3 )i + kax n a 2 n a 4 )i + ^ n a 3 n aoi 
+ka 2 n a 3 n a 4 )i - ka, n a 2 a 3 n a 4 )i 

1 1 . Hint: Partition the set of fractions into blocks, where each block contains fractions that are numerically equivalent. Describe how you would 
determine whether two fractions belong to the same block. Redefine the rational numbers to be this partition. Each rational number is a set of 
fractions . 

Section 2.4 

l.C(10;3)-C(25;4) = 1,518,000 

3. C(10; 7) + C(10; 8) + C(10; 9) + C(10; 10) 

5. 16x 4 -96x 3 y + 216x 2 y 2 -216xy 3 + 81y A 
7. (a) C(52;5) = 2, 598,960 

(b) C(52; 5) • C(47; 5) ■ C(42; 5) • C(37; 5) 
9.C(4;2)C(48;3) 
ll.C(12;3)-C(9;4)-C(5;5) 
13. (a) C(10; 2) = 45 (b) C(10; 3) = 120 

15 . Assume \A\ = n. If we let x = y = 1 in the Binomial Theorem, we obtain 
2" = C(n; 0) + C(n\ 1) + • • • + C(n; n), and as a consequence of Example 
2.4.7 we realize that the right side of this equation says the sum of all subsets 

of A. Hence \P(A)\ = 2 W 
17. 999,400,119,992. 

Supplementary Exercises— Chapter 2 

l.(a) 10-9-8 = 720 
(b) 10-10-10= 1000 
3. (a) 
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3 (e, 3) 


e — 


2 (e, 2) 




1 (e, 1) 
3 (d, 3) 


d — 


2 (d, 2) 




1 (d, 1) 
3(c,3) 


c — 


2(c,2) 
1 (c, 1) 
3(b,3) 


b — 


2(b,2) 
1 (b, 1) 
3 (a, 3) 


a — 


2 (a, 2) 
1 (a, 1) 



15 pairs 



(b) If you imagine drawing a tree diagram for this general case, from the starting point, there will be m branches, one for each element of A. 
From the end of each of the "A branches " there will be n branches , one for each element of B . Therefore , there are m ■ n pairs in A x B . 

5 . (a) If couple A is seated, couple B can be either to their left or right and couple C sits in the other position; therefore, there are two possible 
arrangements . 

(b) 2-2 3 =48 

7. (a) 5 != 120 (b) 5 ! - 2-4 != 72. (Here, we subtract the ways that the 

two could be seated together from the total number of arrangements .) 

9.(a)P(10;4) (b)C(10;4)-C(6; 3) 

ll.C(10;2)-8 = 360 

13.(a)C(ll;5) = 462 (b)C(10;4) = 210 

(c) C(2;l)-C(9;4) + C(9;3) 
15.(a)3P(3;2)= 18 (b) 2P(3; 2) = 12 
CHAPTER 3 

Section 3.1 

l.(a) df\c (b)jV-.c 

(c) i(d/\s) (d)i(Aic 

3. (a) 2 > 5 and 8 is an even integer. False. 

(b) If 2 ;£ 5 then 8 is an even integer. True. 

(c) If 2 ^ 5 and 8 is an even integer then 1 1 is a prime number. True. 

(d) If 2 « 5 then either 8 is an even integer or 1 1 is not a prime number. True. 

(e) If 2 ^ 5 then either 8 is an odd integer or 1 1 is not a prime number. False. 
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(f) If 8 is not an even integer then 2 > 5 . True. 
5 . Only the converse of d is true. 

Section 3.2 



p pV p 


p ^p pAp 


p ->p phirp) 


1- (a) 


(b) 1 


(c) 1 1 


1 1 


1 


1 1 


p pAp 






(d) 






1 1 







3.(a)-.(pA«) V s (b)(pV«) AfrV«) 

5.2 4 = 16 
Section 3.3 

La <=> e, d <=* /, g <=> h 

3. No. In symbolic form the question is: Is (p -> q) «=* (q -» p)? 

P 9 P ^ <? <? ^ P (P "» 9) <-» (9 "» P) 









1 


1 


1 





1 


1 








1 








1 





1 


1 


1 


1 


1 



This table indicates that an implication is not always equivalent to its converse. 
5. Let x be any proposition generated by p and q. The truth table for x has 4 rows 
and there are 2 choices for a truth value for x for each row, so there are 

2 • 2 • 2 • 2 = 2 4 possible propositions . 
(See Table 13.6.1 for an illustration.) 
7 . -» p and p -» 1 are tautologies . 

Section 3.4 

1 . Let s = "I will study" , t = "I will learn." The argument is: ((s -» t) A (-1 1)) -» (-1 .s) , call the argument a. 
.s / .s -» f (.s -» r) A (-i r) a 









1 


1 


1 





1 


1 





1 


1 











1 


1 


1 


1 





1 



Since a is a tautology, the argument is valid. 

3. In any true statement S, replace; A with V, Vwith A,0 with 1, 1 withO, «= with =>,and => with <=. Leave all other connectives unchanged. 
5 . (a) If not EOF then repeat 

Read (ch); 

Count := Count + 1 

Until EOF 
Law used: involution law, Not Not EOF<» EOF} 
(b) S:=0; K := 1; N := 100; 

If7<:<=A^thendo 

Repeat 
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S:=S + K; 
K:=K + \ 
Until K> N 

No Law of logic is really used here, only a law of integers: 
Not (K<N)<?>K>N. 
Section 3.5 



l.(a) 



p 


1 


(p V q) A - q 


dp V q) A - <?) -» p 











1 





1 





1 


1 





1 


1 


1 


1 





1 



P 


q 


(p^q)A^q ^p (p^q)/\(^q) 




(b) 

1 

1 




l 



1 


l l l 
1 1 
1 
1 



3. (a) Direct proof: 

(1) rf-»(aVc) 

(2) d 

(3) aWc 

(4) a^b 

(5) iaV* 

(6) c->6 

(7) -cVfc 

(8) (-. a V*) A ("■ c \/b) 

(9) (-i a A -. c) V b 

(10) i(aVc)V* 

(11) /•■ 

Indirect proof: 

(1) -■*> 

(2) a^fo 

(3) -a 

(4) c^fc 

(5) -.c 

(6) (-.jA-ic) 

(7) -n (a V c) 

(8) rf->(aVc) 

(9) id 

(10) d 

(11) 
(b) Direct proof: 

(i) (p-»«) A (»■-»*) 



Negated conclusion 

Premise 

Indirect Reasoning (1), (2) 

Premise 

Indirect Reasoning (1), (4) 

Conjunctive (3), (5) 

DeMorgan's law (6) 

Premise 

Indirect Reasoning (7), (8) 

Premise 

(9),(10)« 
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(2) p -» q 

(3) (p^t)A(s^u) 

(4) 9 -»f 

(5) p^t 

(6) r^.s 

(7) s -> u 

(8) r->K 

(9) p -> r 

(10) p -» « 

(11) p -» (? A u) Use (n^AfuzJwu^A z) 

(12) -i(r A «)-»-./» 

(13) -.(rA«) 



(14) -, p ■ 
Indirect proof: 

(1) p 

(2) p -» q 

(3) « 

(4) 9 -» t 

(5) ? 

(6) i(fAn) 

(7) -. f V -> u 

(8) -.« 

(9) .s -» u 

(10) -.9 
(ll)r-»j 

(12) -T 

(13) p->r 

(14) r 

(15) ()■ 
(c) Direct proof: 

(1) -iVp 

(2) s 

(3) - (-, .9) 

(4) p 

(5) p -» (q -» r) 

(6) <j^r 



Premise 

Added premise (conditional conclusion) 

Involution (2) 

Disjunctive simplification (1), (3) 

Premise 

Detachment (4), (5) 
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(7) q 

(8) r 
Indirect proof: 

(1) - (s -» r) 

(2) -.(-njVr) 

(3) jAnr 

(4) .9 

(5) -uV/» 

(6) .s -» p 

(7) p 

(8) p->(<?->r) 

(9) g^r 

(10) 9 

(11) r 

(12) -r 

(13) 
(d) Direct proof: 

(1) P^<7 

(2) <?^r 

(3) p->r 

(4) pV'' 

(5) -ypMr 

(6) (pV'-)A(-pVr) 

(7) (p A - p) V r 

(8) V r 

(9) /■■ 
Indirect proof: 

(1) -T 

(2) p V r 

(3) P 

(4) p^g 

(5) « 

(6) q^r 

(7) r 

(8) 



Premise 
Detachment (6), (7) I 



Negated conclusion 
Conditional equivalence (I) 
DeMorgan (2) 
Conjunctive simplification (3) 

Premise 

Conditional equivalence (5) 

Detachment (4), (6) 

Premise 

Detachment (7), (8) 

Premise 

Detachment (9), (10) 

Conjunctive simplification (3) 

Conjunction (11), (12) ■ 



Negated conclusion 

Premise 

(1),(2) 

Premise 

Detachment (3), (4) 

Premise 

Detachment (5), (6) 

(1),(7)B 



5. (a) Let W stand for "wages will increase," / stand for "there will be inflation," and C stand for "cost of living will increase." Therefore the 
argument is: W -» /, -i / -» -< C, W => C The argument is invalid. The easiest way to see this is through a truth table. Let x be the conjunc- 
tion of all premises . 
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W 



c -./ 



<C W->I -./-»-.C 












1 


1 


1 














1 


1 





1 


1 








1 








1 


1 


1 








1 


1 








1 


1 





1 








1 


1 











1 





1 


1 








1 





1 


1 








1 


1 


1 


1 


1 


1 


1 








1 


1 


1 



(b) Let r stand for "the races are fixed," c stand for "casinos are crooked," t stand for "the tourist trade will decline," and p stand for "the police 
will be happy." Therefore, the argument is: 

(rV c) -» t, t -» p, -i p -» -i r. The argument is valid. Proof: 



(1) t^p 


Premise 


(2) -,p 


Premise 


(3) -if Indirect Reasoning (1), (2) 


(4) (rVc)^r 


Premise 


(5) i(rVc) 


Indirect Reasoning (3), (4) 


(6) (-, r) A (-■ c) 


DeMorgan (5) 


(7) -T 


Conjunction simplification (6) 



1 . P\ -> Pk an d Pk ~* Pk+i implies p\ -» p^ +1 . It takes two steps to get to pi -» p k+l from pi -» p k This means it takes 2 (100 - 1) steps to get to 
Pi -> Pioo (subtract 1 because pi -» pi is stated as a premise). A final step is needed to apply detachment to imply pioo 

Section 3.6 

1. (a) {{1), (3), {1,3),0} 

(b) {{3), {3,4), {3,2), {2,3,4)) 

(c) {{1), {1,2), {1,3), {1,4), {1,2,3), {1,2,4), {1,3,4), {1,2,3,4)) 

(d) {{2), {3), {4), {2,3), {2,4), {3,4)) 

(e) {A c U : \A\ = 2) 

3. There are 2 3 = 8 subsets of U, allowing for the possibility of 2 8 nonequivalent 
propositions over U. 

5. sis odd and (s - 1) {s - 3) (s - 5) (s - 7) = 
7. b and c 

Section 3.7 

1 . We wish to prove that P(n) : 1 + 3 + 5 + • • • + (2 - n) = re is true for n s 1 . Note: The nth odd positive integer is 2n - 1 . 

Basis: for n = 1 : 1 = l 2 

Induction: Assume that for somen >; 1 , p(n) is true. Then: 

1+3 + — +(2(n + l)-l) + l = [l+3 + — +(2n-l)] 
+(2(n+l)-l) 

= n 2 + (2 n + 1) by p(n) and basic algebra 



3. Proof: 



= (i, + l) 2 
(a) Basis: 



1 = 1(2)(3)/6 = 1 
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(b) Induction: £ £ 2 = ££ 2 + (« + I) 2 

1 1 

n(n+l)(2n+l) 



6 
(n+l)(2n 2 +7/i+6) 

6 

_ (n+l)(n+2)(2n+3) 
6 

5. Basis: For n = 1 , we observe that 



+ (n + If 



(1-2) (i+i) 

Induction: Assume that for some nsl, the formula is true. 

Then: — + ••• + 

(1-2) 



1 


n 1 


((B+l)(B+2)) 


(b+1) ((b+1)(b+2)) 








((n+l)(«+2)) 




(n+1) 2 




((«+!) (n+2)) 




- <2±ii a 

(n+2) 



7. Let A„ be the set of strings of zeros and ones of length n (we assume that \A n \ = 2" is known), E„ = the even strings, and E c n = the odd strings. 

The problem is to prove that for n =s 1, \E n \ = 2" _1 . Clearly, |£'| = 1, and, if for some n » 1, |£„| = 2"- 1 , it follows that \E n+1 \ = 2" by the 
following reasoning: 

£„+i =(l.s :,rrn££} U (0.s: sin£:„} 

Since {Is: s'mE c n ) and {0.s : smE n ) are disjoint, we can apply the addition law. Therefore, \E n +\\ = \E c n \ + \E n \ 

= 2"-' +(2"-2"- 1 ) = 2 n . ■ 

9. Assume that for n persons (n =; 1), — — - handshakes take place. If one more person enters the room, he or she will shake hands with n 

, (n-l)n (n 2 -n+2n) n(n+l) 

people, +n= = 

r r > 2 2 2 

_ ((.i+l)-l)(n + l) 

2 

Also, for n = 1 , there are no handshakes: =0. ■ 

2 

1 1 . Let p(n) be "ai + 02 + • • • + a„ has the same value no matter how it is evaluated." 

Basis: a^ + «2 + 03 may be evaluated only two ways. Since + is associative, (a\ + ai) + 03 = a\ + (02 + 03). Hence p (3) is true. 

Induction: Assume that for some n & 3 p(3), p(4), . . . , p(«)are all true. Now consider the sum a\ + 02 + ••• + a n +\. Any of the n 
additions in this expression can be applied last. If the 7th addition is applied last, we have c t ■ = [a^ + a^ + ■■■ + a^ + Uij+i + ••• + a„+i). No 

matter how the expression to the left and right of the f b addition are evaluated, the result will always be the same by the induction hypothesis, 
specifically p(j) and p(n + 1 - j). We now can prove that c\ = Ci = ■■■ = c„. If i < j, 

ct = (ai + a% + ■■• + at) + ((a,+ i + •■■ + aA + (a,+ i + ••• + a n +u 

= {a\ + ai + ■■■ + a,) + f(a; + i + ••• + aA + (a J+ i + ••• + a„ + i) 

= ((a, + ■■■ +a,) + (a, + 1 + ••• + a,)) + (a J+] + ••• +a„+i) byp(3) 
= c ; 
c, = (ai + 02 + •" + a /) + ( a i+i + """ + a j + a j+i + •" + a n+i) definition of c, 

= (ai + 02 + •" + a i) + (( a i+i + •" + a j) + ( a y+i + ■■■ + a n+ ij by p(n + 1 — i) 
= ((a, + ••• +a,) + (a, + 1 + ••• + a;)) + (a,- +] + ••• +a„+i) byp(3) 

= (ai + ••• + a; + a,-+i + ••• + ay) + (a ;+ i + • • • + a„ + i) by p(i) 

= Cj definition of Cj ■ 

13. For m s 1 , let p(m) be x n+m = x" x m for all n s 1 . The basis for this proof follows directly from the basis for the definition of exponentiation. 
Induction: Assume that for some m =s 1 , p(m) is true. Then 
x n+(m+i) _ x (n+m)+i by associativity of integer addition 
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_ x n+m x i by recursive definition 

= x" x m x 1 inductive hypothesis 

_ x n x m+ 1 recursive definition ■ 

Section 3.8 

l.(a)(Vi)(F(i)^G(i» 

(b) There are objects in the sea which are not fish. 

Every fish lives in the sea. 

3. (a) There is a book with a cover that is not blue. 

(b) Every mathematics book that is published in the United States has a blue cover. 

(c) There exists a mathematics book with a cover that is not blue. 

(d) There exists a book that appears in the bibliography of every mathematics book. 

(e) (Vjc)(B(jc)-»M(jc)) 

(f) (3x)(.M(x)A-<U(x)) 

(g) (3*)((Vy)(-.*(j(,y)) 

5. The equation 4 u 2 - 9 = has a solution in the integers. (False) 

7. (a) Every subset of U has a cardinality different from its complement. (True) 

(b) There is a pair of disjoint subsets of U both having cardinality 5 . (False) 

(c) A - B = B c - A c is a tautology. (True) 
9.(V ci)q (V &)q(q + b is a rational number.) 
11. Let /={1,2,3, ...,n) 

(a) (3 ;)/(*£ A,) 

(b) (V ;)/(* e A,) 
Section 3.9 

1 . The given statement can be written in if . . . , then . . . format as: If x and y are two odd positive integers, then x + y is an even integer. 

Proof: Assume x and y are two positive odd integers. It can be shown that x + y = 2 -(some positive integer). 

x odd => x = 2 m + 1 for some iiieP, 

y odd => y = 2 n + 1 for somen e P. 

Therefore, x + y = (2m + \) + (2n+l) = 2 ((m + n) + 1) = 2-(some positive integer) so x + y is even. ■ 

3. Proof: (Indirect) Assume to the contrary, that yl is a rational number. Then there exists p, q e Z, (q + 0) where - = V2 and where 

- is in lowest terms , that is , p and q have no common factor other than 1 . 
q 

- = V 2 =>— = 2=>p 2 =2g 2 => p 2 is an even integer => p is an even integer (see Exercise 2) 4 is a factor of p 2 => q 2 => is even => g is even. 
Hence both p and q have a common factor, namely 2. Contradiction. ■ 

5. Proof: (Indirect) Assume ijeR and x + yssl. Assume to the contrary that (x =s -ory^ -) is false, which is equivalent to 

x > - and y > - . Hence x + y>-+- = l. This contradicts the assumption that x + y ^ 1 . ■ 

Supplementary Exercises— Chapter 3 
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p 

l-(a) 
1 

P 
(c)0~ 
1 



pVp 



-•p 



p 

(b) 
1 



pV-p 



1 

o 

p 

(d)0 
1 



pA-p 



o 



pAp 



p 


q 


-•p 


ql\^p 


P V (? A - p) 








1 











1 


1 


l 


l 


1 











l 


1 


1 








l 



5 . Let a ■■ 



-<q, b = q\fr, and c = (p -» -> #) A (q V A -"" 



p 


<? 


r 


-"? 


a 


b 


a l\b 


-i r 


c 


->p 


C -» -i p 











1 










1 





1 










1 


1 




1 


1 








1 







1 










1 


1 


1 


1 


1 







1 


1 







1 


1 








1 




1 








1 










1 










1 





1 


1 




1 


1 













1 


1 











1 





1 










1 


1 


1 








1 

















1 . An implication is always equivalent to its contrapositive, as can be seen in the table below. 



p 


q 


p^q 


-iq=>-,p 


(p^>q)<^(^q^>^p) 








1 


1 







l 


1 


1 




1 













1 


1 


1 


1 





9. The truth tables of p -» (p A q) and x must be equal. 



p 


1 


p -> (p A q) 


X 








l 


1 





1 


l 


1 


1 











1 


1 


l 


1 



1 1 . (a) "3 is not a prime number or it is odd," or "3 is a composite number or it is odd." 

(b) "4 is not a prime number and it is even," or "4 is a composite number and it is even.' 

(c) I can exhibit an example of a statement and I cannot prove it. 

(d) x 2 -lx+ 12 = 0andx*3andjc* 8 



13. (a) 



p 


q 


p^q 


""7 -> - 1 P 


(p^q)<^(-,q^^p) 








l 


l 







l 


l 


l 




1 













1 


1 


l 


l 
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p 


<7 


p^q 


p^q 


q^p 


(p->q) A(q-> p) 








1 


1 


1 


l 





1 





1 








1 











1 





1 


1 


1 


1 


1 


l 



(b) 



Columns 3 and 6 are the same so (p <-> q) <=> [(p -> g) /\ (q -» p)J is a tautology. 
15. Let <? = "I quit my job," .s = "will starve," and w = "I did my work." In 

symbolic form, the argument is (q -» .s) A C -1 vf -» <j) A vc => -> .s. The truth 

table for x=(g^> s)A(tiii-)^Aw^ts does not consist of all ones . 
For example, when q is false, s is true and when w is true, x is false. Therefore, 
x is not a tautology and the argument is not valid. 

17. [(m->p) A(e\f-*p) A^e] =>->m. Valid 

Proof: (Direct) 

(1) e V ~< p Premise 

(2) -i e Premise 

(3) -i p (1), (2), disjunctive simplification 

(4) m -> p Premise 

(5) -im (3), (4), indirect reasoning ■ 

19. [(-< p -> - q A - r A (p -> s) A (9 V r) => s. Valid 

Proof: (Direct) 



(1) 


->p-*->q 


Premise 


(2) 


?-» P 


(1), Contrapositive 


(3) 


p -> s 


Premise 


(4) 


q -> s 


(2), (3), Chain rule 


(5) 


q\Jr 


Premise 


(6) 


-i r 


Premise 


(7) 


1 


(5), (6), Disjunctive simplification 


(8) 


s 


(4), (7), Detachment ■ 


(Indirect) 




(1) 


-i s 


Negated conclusion 


(2) 


p -» s 


Premise 


(3) 


~T 


(1), (2), Indirect Reasoning 


(4) 


-<p->-<q 


Premise 


(5) 


~"1 


(3), (4), Detachment 


(6) 


q\/r 


Premise 


(7) 


r 


(5), (6), Disjunctive simplification 


(8) 


-i r 


Premise 


(9) 





(7), (8) ■ 



21. 



e -» ;', i -» d, d -» w => e -» w 
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Proof: 

(1) e -» i Premise 

(2) i -> <i Premise 

(3) e^d (1), (2), Chain rule 

(4) d -> w Premise 

(5) e -» w (3), (4), Chain rule ■ 

23. Valid. Statement: f V <*. "■ « V j, "" 7 V >"=>-.?-> r 
Proof: (direct) 

(1) f V <i Premise 

(2) -i t -* d (1), Conditional equivalence 

(3) -i d V y Premise 

(4) rf -» 7 (3), Conditional equivalence 

(5) -i j V r Premise 

(6) y -» r (5), Conditional equivalence 

(7) -.f-»r (2), (4), (6), Chain rule ■ 
25. (1) First show T phq c T p f] T q 

a e T p i\q => a makes p f\q true 

=> a makes p true and a makes g true 
=> a e T p and a e r„ 
=> a e T p Pi r, 
(2) To prove 7\, f] T„ C 7^/^ reverse the above steps . ■ 

27.60 = 6-10 = 2-3-2-5=2 2 -3-5 
120 = 2-60 = 2 3 -3-5 

I n \ { n 
29 (a) , , + 



k-1) \k) (n- (*-!))! (*-!)! (n -*)!*! 



(n-k+l)l(_k-l) (n-k)\k\ 
nlk + nl (n-k+ 1) 
(/>-*+ 1)1 tl 

n!(n+l) <«+!)! 



(n-k+l)\k\ (n+\-k)\k\ 

(b) Basis: (n = 1) : (x + y) 1 = x + y . 
11 / 1 



fc=0 v K ; 



x ' ^^io P + l 1 ' v= v+ v 



Induction: Assume n =; 1 and (x + j)' ! =21, \x n y 

k=o^ k 



We will now prove (x + y)" +[ = JJ+o t -^'^ /• 



' n + 1 
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( x + y )" +l = (jc + y) (x + yf 



= (x + y) XJ! =0 \x n k y k By induction hypothesis 

= x 5 =0 ( " J x"- k y k + y 5 =0 [ " J x"- k y k distribution 

= I , \x n+, - k y k + £ , U"~V + 1 
k=0 \ k I k=0 \ k > 

Let k = k - 1 in the second summation; remember to increase top. 

- V" I n I v" +1 -* 1)* 4- V" + 1 I " I v ll-*+l ,,fc . 

;)^ + J 5[(*) + (*-i)]^ /+ (: 1 ^ 

o)^^i( n r)^ +i -^ + (;)^ 



But i ;; i -i " ; ' \ - umdi "U(" + i - 1 

n ) \ n + 1 



■(:)■("')■ 

"±i/n+ 1 \ ^, . , 
fc=0 ^ ' 

3 1 . - (3 x) ((V j) (D(x) A T(y))) => (V *) (-. (V y) (£>« A ?Xy))) 
=>(V*)((3y)(-.(D(*)AlXj')))) 
=>(V*)((3y)(-.D(*)V^7Xy))) 
=> All sailing is not dangerous or some fishing is 
not tedious . 

33. (a) Let U be the universe of all fish, k(x) ="x is kind to children," and s(x) ="x is a shark." (V x) u (i s(x) -» fc(;t)) 

(b) Let vv(jc) = "jc is a wine drinker," 

c(x) ="x is very communicative," 
p(x) = "x is a pawnbroker," 
and h(x) = "x is honest"; 

then (V x) ((3 y) ((tv« -» c(x)) V (p(y) -» (AW A - My))))) 

(c) Let p(x) = "x is a clever philosopher," 

c(x) = "x is a cynic," 
and w(x) = "x is a woman"; then 

(V x) (0 y) (((p(x) -» c(x)) A (w(x) -» p(x))) -» (p(y) -» ( W (y) -» c(y))))) 
35. (V a) R + (V b) R + (3 n) P (na > b) 

CHAPTER 4 

Section 4.1 

1. (a) Assume thatx e A (condition of the conditional conclusion A c C). Since A c B, x e B by the definition of c. fie Cand ieD implies 
that ieC Therefore, if x e A, then x e C. ■ 

(b) (Proof that A -B c A f| B c ) Let x be in A - B. Therefore, x is in A, but it is not in B; that is, x e A and x e B c => x e A fl # r - ■ 

(c) ( => ) Assume that A c B and A c C. Let jc e A. By the two premises, x e B and jc e C. Therefore, by the definition of 

intersection, x e B f]C.M 

(d) (=>) (Indirect) Assume that AcC and B c is not a subset of A c . Therefore, there exists iefi r that does not belong to A c . 
x £ A c => x e A. Therefore, x e A andx <£ B, a contradiction to the assumption that A c B. ■ 

3. (a) If A = Z and B = 0, A - B = 2, while B - A = 0. 

(b) IfA = {0)andB = (1),(0, 1) e A xB,but(0, 1) is not inBxA. 
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(c)LetA = 0,fi = {0},andC = {1). 
5 . Proof: Let p (n) be 

AC](B l \JB 2 \J---\JB„) 

= (Ar\B l )U(Ar\B 2 )u---U(Ar\B„). 

Basis: We must show that p (2) : A f] (B t \J B 2 ) = (Af]Bi) \J (A [~| #2) is true. This was done by several methods in section 4.1 . 
Induction: Assume for some n > 2 that p(n) is true. Then 

a n (s, u s 2 u ■ ■ ■ u B„ + ,) = a n t(s, u s 2 u ••• u s„) u s„ +1 ] 
= (a n (5i u b 2 u ••• u s»)) u (a n s„ + o 

by/7(2) 

= ((a n«i) u - u (A n b„)) u (a n s„ +1 ) 

by the induction hypothesis 

= (Ans,)u---U(Ans„)U(Ans„ + i) ■ 

Section 4.2 

l.(a) 






\ 


H 




.1 






B 


t' 


















1*'- Bl 'C 


1 - retiipnt «t (Ik umw Ml 


(b) A 





A 1 ' 


B c 


A{jB 


(a u by a c n S c 





1 


1 





1 1 





1 


1 





1 





1 








1 


1 





1 


1 








1 






The last two columns are the same so the two sets must be equal. 

(c) (i)x e A U A => x e A or x e A by the definition of fl 
=> x e A by the idempotent law of logic 
Therefore.AlJAcA. 
(ii) x e A => x e A and x e A by conjunctive addition 

=> ie A IJ A 
Therefore, A c A IJ A and so we have A IJ A = A. ■ 

3. For all parts of this exercise, a reason should be supplied for each step. We have supplied reasons for part a only and 
left them out of the other parts to give you further practice. 

(a) A J (B - A) = A J (B f| A') by Exercise 2b of Section 4.1 

= (A J B) Pi (A J A c ) by the distributive law 

= (A IJ B) Pi U by the null law 

= (A J B) by the identity law ■ 

(b) A - B = A |J B ' 
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= B c n cat 

= B C -A c 

(c) Select any element, x, in A f] C. One such element exists since A f] C is not empty. 
xeAC\C=$xeA and x eC 
=> x e B and xe C 

^xeBf]C 
^Bf)C±0 m 
(d)Af](.B-C)= ADiBDC) 

= (a n b n a c ) u (a n b n c c ) 
= (a n b) n (a c u c c ) 
= (a n b) n (a u cr 

= (A-B)n(A-C) ■ 

(e)A-(BUC) = Afl(BUC) c 
= Afl(B c nC r ) 

= (a n b c ) n (a n c c ) 

= (A-B)n(A-C) ■ 

3 12 13 2 2 3 1 

5.(a)AUB c nC (b)AflfiUCns (c) A U B U C c 

Section 4.3 

1. (a)(1), (2, 3, 4, 5), (6), (7, 8), (9, 10} 

(b) 2 5 , as compared with 2 10 . { 1 , 2) is one of the 992 sets that can't be generated. 

3. B { = (00, 01, 10, 11) andB 2 = 10, 00, 01) generate minsets (00, 01), (0), (10, 11), and (A, 1). Note: A is the null string, which has 
length zero. 

5.(a)B,nB 2 = 

B, n B c 2 = (0, 2, 4) 

B c inB2 = (l,5| 

B C [ C\B C 1 = {3\ 

(b) 2 3 , since there are 3 nonempty minsets . 

7. Let ae A. For each i, a e B;, or a e B, r , since B; IJ Bf = A by the complement law. Let D, = B, if a e B,, and D = B, c otherwise. Since a is in 
each D h it must be in the minset D t f] D 2 ■■■ f] D„. Now consider two different minsets M\ = D\ f] D 2 ■■■ f] D„, and M 2 = G\ f] G 2 ■■■ f] G„, 
where each £), and G, is either B, or B; c . Since these minsets are not equal, D, + G,, for some i. Therefore, 
M\ Pi A/2 = D\ p| D2 • • • Pi On Pi Gi P| G2 • • • Pi G„ = 0, since two of the sets in the intersection are disjoint. Since every element of A is in a 
minset and the minsets are disjoint, the nonempty minsets must form a partition of A. ■ 

Section 4.4 

l.(a)An(BUA)=A 

(b) A n ((B r n A) U Bf = 

(c) (A n B c f U B = A c U B 

3. (a) (p A "'(-'« Ap)V*)) »0 

(b) (-■ (p V (- 9)) A q) <=> ((- p) A q) 
5. The maxsets are: 

BiUB 2 = {l,2,3,5} 
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B, 


UB/ = 


= {1, 3,4, 


5,6) 


B] 


r UB 2 = 


= {1,2,3, 


4,6) 


Bi 


r UB 2 c 


= {2,4,5 


, 61 



They do not form a partition of A since it is not true that the intersection of any two of them is empty. A set is said to be in maxset normal form 
when it is expressed as the intersection of distinct nonempty maxsets or it is the universal set U. 

Supplementary Exercises— Chapter 4 

1 . (a) Proof: (=>) (Indirect) Assume Ac B and A \J (U - B) =£ 0. To prove that this cannot occur, let x e A f) {U - B). 

xeAf](U-B) 

Definition of complement => xeAf) B c 

Definition of Pi =>ieA and x e B c 

Definition of complement => x e A and x £ B 

Definition of subset => A is not a subset of B 

This contradicts the premise that A c B. Hence this part of the statement is proven. 

(<=) (Indirect) Assume Af](U-B) = 0, and A is not a subset of B. To prove that this cannot occur, let x e A such 
that x <£ B. 

x e A and x £ B 

Definition of complement => x e A and x e B c 

Definition of f] => x e A f] B c 

Definition of complement => x e Af) (U - B) 

Definition of disjoint ^> A{~\(U - B) ± 

But this cannot happen because it contradicts the assumption that Af)(U - B) = 0. Hence this part of the statement is proven and the proof is 
complete. 

(b) Proof: (Indirect) Assume U = A(JB,Af)B = 0, and A±U - B. One way in which A and U - B can be not 
equal is that A is not a subset of U - B. Let x e A and x$ U - B. 

x e A and x£ U - B 

Definition of complement =>ieA and isfi 

Definition of H => xcAf] B 

Definition of disjoint => Af]B =£ 

But this cannot happen because it contradicts the assumption that A[) B = 0. The other way A and U - B can diffe is if U - B is not a subset 
of A, Let x £ A and x e U - B. We could infer from this assumption thatx x £ A {J B. Therefore, any way that we assume that A + U - B leads 
to a contradiction. 

(c) Proof: (=>) (Direct) Let x e A. 
A and B are disjoint 
Definition of disjoint => x £ B 
Definition of complement => x e B c 

Therefore, A qB c 

(<=) (Indirect) Assume that Ac B c and x e Af]B. 

xc-AOB 

Definition of intersection => x e A and x c B 

Definition of complement => x e A and x $ B c 

Definition of subset => A is not a subset of B c ■ 

3. (a) Proof: (Direct) Let A, B, and C be sets. 

Lett*, y) c(A[JB)xC. 

Definition of Cartesian product => x e (A IJ B) and y e C 

Definition of IJ => (x e A orx e B) and y e C 

Distributive law of logic => (x e A and j e C) or (x e £ and y <= C) 

Definition of Cartesian product => ((x, y) e A x C) or 

((x, j)eBxC) 

Definition of U => (x, y) e (A x C) U (5 x C) ■ 

(b) We proved (A U B) x C c (A x C) U (B x C) in part a; we now must show (AxC)\J (BxC) c(A{J B)xC and we will be finished. 

5. Proof: (Indirect) Assume A, B, and C are subsets of U, A c B, B c C and C c is not a subset of A c . To prove that this cannot occur, let x e C c 
and x £ A r by definition of subset. 
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ieC and x £ A c 

Definition of complement 

Premise 

Definition of subset 

Premise 

Definition of subset 



> x g C and x e A 
. AcB 

• BcC 

• x e C (Contradiction) 



7. (a) Proof: (Indirect) Let A, B, and C be sets. Assume A \J C =/= B \J C and A = B. 



A = B^>AcB 
x<=A\JC 
Definition of union 
Definition of subset 
Definition of union 



• x e A or x e C 

■ieBokeC 
<xeB(JC 

Therefore, AlJCcBlJC. Bya s i ml l ar ^ ne of reasoning we can infer B {J C c A {J C, which proves that 
A(J C = B (J C, a contradiction. 

(b) Proof: (Direct) Assume A =/= B and show A c t B c . Since A =£ B we can assume that A is not a subset of B. The 
alternative is that B is not a subset of A and the remaining logic would be identical. 

A not a subset of B 

Definition of subset => x e A and x £ B 

Definition of complement => x £ A r and x e B c 

Definition of subset => B r is not a subset of A c 

Definition of inequality => A c + B c ■ 

9. (a) The minsets are B v fl #2 = (3), Bi c fl #2 = (2, 5), B, [~}B 2 C = {\), and B^ fl # 2 C = I 4 ' 6 1 

(b) The minsets are disjoint and 

(Si n b 2 ) u (Bi c n b 2 ) u (Bi n b 2 c ) u (Bi c n b 2 c ) = u, 

so the minsets form a partition of U. 

CHAPTER 5 

Sections 5.1-5.3 

1 . For parts c, d and i of this exercise, only a verification is needed. Here, we supply the result that will appear on both sides of the equality. 



(a) AB = 


(1 


6 

-13 


] BA = 


/ 2 3 
1-7 -18 


) 




(3 \ 
(C) (l5 -6j 




(d) 


/ 18 -15 
{ -39 35 


15 

-35 


) 


(f) B + = B 




(g) 


lo oj 




»(S 


3 f 1/2 ° ) 
I 1/3 J 












( 1 ° 


> 




f ! 




\ 


5. A 3 = 


8 





A L 




32768 









(0 


27 J 






^0 


14348 907 J 



(b) 
(e) 
(i) 



1 

5 -2 

-12 
5 

5 -5 
10 15 



5 -5 
-25 25 



7. (a) Ax = I , , equals , I if and only if both of the equalities 

1 xi - 1 x 2 ) \ 1 ' 



2 Xi + ^2 = 3 and x\ - x 2 = 1 are true. 

1 1 j 1x2. 



(b) (i) 





fill 


* 




( Xy \ 




1 \ 


(ii) A = 


1 2 -1 


x = 


X2 


B = 


-1 




ll 3 1 J 




v*3 J 




5 J 


t 1 1 i 




fxi •• 




f 3 1 


(iii) A = 


1 


x = 


X2 


B = 


5 




,10 3; 




1^3 y 




U J 


Section 5.4 
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l.(a) 



(d)A- 



-1/5 3/5 
2/5 -1/5 

. (e) 



(b) 



1 3 
1 



r 1/3 o 

2 






-1/5 J 



(c) No inverse exists. 



3. Let A and B be n by n invertible matrices. Prove (AB) ' = B ' A ' . 

Proof: (B" 1 A" 1 ) (Afi) = (fi- 1 ) (A-'(AB)) 

= (B- [ )((A- l A)B)) 
= (B" 1 )(/B) 
= (B" 1 )(B) 
= / 

Similarly, (AB) (B" 1 A" 1 ) = /. 

By Theorem 5.4.1, B" 1 A -1 is the only inverse of AB, If we tried to invert AB with A -1 B _1 , we would be 
unsuccessful since we cannot rearrange the order of the matrices. 

5 . (b) 1 = det / = det(AA~ ' ) = det A det A~ ' . Now solve for det A" 1 . 

7. BasiS: (n= 1): det A 1 =detA = (detA)>. 

Induction: Assume det A" = (det A)" for some n > 1. 

det A" + ' = det(A' ! A) by the definition of exponents 

= det(A") det(A) by exercise 5 

= (det A)" (det A) by the induction hypothesis 
= (detA)" +1 

9. (a) Assume A = BOB" 1 
Basis(;» 



1): A A 1 =BD [ B 1 is given. 
Induction: Assume that for some positive integer m, A" 1 



-. BD'" B~ 



(b) A" 



: BD W B 



A m A 

(BD m B- [ ) (BDB~') by the induction hypothesis 

BD"' DB l by associativity, definition of inverse 

BD m+t B -i 

9206 15 345 
6138 10231 



Section 5.5 



1. (1) Let A and B be m by n matrices. Then A + B = B + A, 

(2) Let A, B, and C be m by n matrices. Then A + (B + C) = (A + B) + C. 

(3) Let A and B be m by n matrices, and let c e R. Then c(A + B) = cA + cB, 

(4) Let Abe anmby n matrix, and let ci, c 2 e R. Then {c\ + c 2 )A = c t A + c 2 A. 

(5) Let A be an m by n matrix, and let Ci, c 2 6 R. Then Cife A) = (C| c 2 ) A 

(6) Let be the zero matrix, of size m by n, and let A be a matrix of size n by r. Then A = = the m by r zero matrix. 

(7) Let A be an m by n matrix, and = the number zero. Then A = = the m by n zero matrix. 

(8) Let A be an m by n matrix, and let be the m by n zero matrix. Then A + = A. 

(9) Let A be an m by n matrix. Then A + (-l)A = 0, where is the m by n zero matrix. 

(10) Let A, B, and C be m by n, n by r, and n by r matrices respectively. Then A(B + C) = AB + AC. 

(11) Let A,B, and C be m by n, r by m, and r by m matrices respectively. Then (B + C) A = BA + CA. 

(12) Let A, B, and Cbembyn,nby r, and rby p matrices respectively. Then A(BC) = (AB) C. 

(13) Let A be an m by n matrix, /„, the m by m identity matrix, and /„ the n by n identity matrix. Then I m A = A/„ = A 

(14) Let Abe an nby n matrix. Then if A~' exists, (A~')~' = A. 

(15) Let A and B be nby n matrices. Then if A" 1 and B l exist, (ABy 1 = B~ [ A" 1 . 

I 21 5 22 
3.(a)AB + AC = U -6 
(b) A(B + C) = AB + AC 



(c) A- 1 = 
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(d) (A 2 y [ = (AA)- ] = (AA' 1 ) '=/-'=/ by part c 
Section 5.6 

1. In elementary algebra (the algebra of real numbers), each of the given oddities does not exist. 

(i) AB may be different from BA . Not so in elementary algebra, since ab = ba by the commutative law of multiplication. 

(ii) There exist matrices A and B such that AB - 0, yet A + and B ± 0. In elementary algebra, the only way ab = is if either a or b is zero. 

There are no exceptions . 

(iii) There exist matrices A, A 4= 0, yet A 2 = 0. In elementary algebra, a 2 = <=> a = 0. 

(iv) There exist matrices A 2 = A. where A =/= and At I. In elementary algebra, a 2 =a<=>a = 0otl. 

(v) There exist matrices A where A 2 = I but A 1 1 and A t -I. In elementary algebra, a 2 = l<=>a = lor-l. 

3 . (a) det A + => A" ' exists , and if you multiply the equation A 2 = A on both sides by A" ' , you obtain A = /. 

/ 1 
(b) Counterexample: A = I „ 

, / 1/3 1/3 \ 
5.(a)A- 1 =( 1 J 3 _ 2 ' J x 1= 4/3,andx 2 = l/3 

(b)A-'=([ ~M *i=4,andx 2 =4 

, / 1/3 1/3 \ 
(C) (l/3 -2/3 J -*i= 2 / 3 > and *2 = - 1 / 3 

, / 1/3 1/3 \ 
(d)A =(l/3 -2/3) ^=°' and ^ = 1 

(e) The matrix of coefficients for this system has a zero determinant; therefore, it has no inverse. The system cannot 
be solved by this method. In fact, the system has no solution. 

Supplementary Exercises— Chapter 5 

x + y 5 \ (3 5 \ ( x + y = 3 ( y = -1/2 



" -2 x-y I \-2\)'\x-y = \'\x = lj2 
3. For n > 1 let p(«) be AB" = B" A 



BasiS:(/f = 1): AB 1 = B [ A is true as given in the statement of the problem. Therefore, p(\) is true. 

Induction: Assume n > 1 and p (n) is true. 

AB" +1 =(AB")B 

= (B n A) B By the induction hypothesis 
= (B n B)A By p (I) 

= B" +[ A m 
, , . (1 18 

7. D has no inverse if det D = 0. 
det D = <=> 3 c - /(15) = 3 c - 60 = <=> c = 20 

9. (a) (A + B) 2 = A 2 + AB + BA + B 2 

(b) (A + B) 2 = A 2 + 2 AB + B 2 only if AB = BA. 

I 1 \ /0 1 \ 

11. The implication is false. Both and are self-inverting, but their product is not. 

(a b \ , 

13. Yes, matrices of the form A = . also solve A = / 

l(l-a) 2 /^ -a J 

CHAPTER 6 

Section 6.1 

1. (a) (2, 4), (2, 8) (b) (2, 3), (2, 4), (5, 8) (c) (1, 1), (2, 4) 
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3. (a) r = {(l,2), (2,3), (3,4), (4,5)) 

(b) r 2 = {(1,3), (2,4), (3, 5)} = {(*, y) : y = x + 2, x, y e A) 

(c) r 3 = {(l,4), (2, 5)) = {(x, y) : y = x + 3, x, y e A) 
5. (a) When n = 3, there are 27 pairs in the relation. 

(b) Imagine building a pair of disjoint subsets of S. For each element of 5 there are three places that it can go: into the first set of the ordered 
pair, into the second set, or into neither set. Therefore the number of pairs in the relation is 3", by the product rule. 

Section 6.2 

1. 




i 

3. See Figure 13.1.1 of Section 13.1. 

5. A Hasse diagram cannot be used because not every set is related to itself. Also, {a} and \b) are related in both directions. 

Section 6.3 







(c) The graphs are the same if we disregard the names of the vertices . 



3. (a) (i) 



(iv) 



(vii) 



reflexive (ii) 

not symmetric 
not antisymmetric 
transitive 

not reflexive (v) 
symmetric 
antisymmetric 
transitive 

not reflexive 
not symmetric 
not antisymmetric 
not transitive 



reflexive (iii) 

not symmetric 
antisymmetric 
transitive 

reflexive (vi) 

symmetric 

not antisymmetric 

transitive 



not reflexive 

symmetric 

not antisymmetric 

transitive 

reflexive 

not symmetric 

antisymmetric 

transitive 



(b) Graphs ii and vi show partial ordering relations. Graph v is of an equivalence relation. 
5. (a) No, since for example | 1 - 1 | =0 + 2 
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(b) Yes, since \i-j\ =\j-i\ 

(c) No, since | 2 - 4 | = 2 and | 4 - 6 | = 2, but | 2 - 6 | =4*2. 



(d) 



GQSC© 
GG3CG) 



7. (b) c(0) = {0), c(l) = {1, 2, 3) = c(2) = c(3) 

(c) c(0) U c(l) = A and c(0) [~| c(l) = 

(d) Let A be any set and let r be an equivalence relation on A. Let a be any element of A. a e c(a) since r is reflexive, so each element of A is 
in some equivalence class. Therefore, the union of all equivalence classes equals A. Next we show that any two equivalence classes are either 
identical or disjoint and we are done. Let c(a) and c (b) be two equivalence classes, and assume that c(a) f] c(b) 4= 0. We want to show that 
c(a) = c(b). To show that c(a) c c(b), letx e c{a). x e c(a) => arx. Also, there exists an element, y, of A that is in the intersection of c (a) and c (b) 
by our assumption. Therefore, 

ary and bry => ary and yrb (r is symmetric) 
=> arb (transitivity of r) 
Next, 

arx and arb => xra and arb 
=> xrfc 
=> brx 
=> x e c(fc) 
Similarly, c(fc) c c{a). ■ 

9. (a) Equivalence Relation 

c(0) = {0}, c(l) = {1}, e(2) = {2, 3} = c(3), e(4) = {4, 5) = c(5), 
c(6) = {6, 7} = c(7) 

(b) Not an Equivalence Relation 

(c) Equivalence Relation 

c(0) = {0, 2, 4, 6) = c(2) = e(4) = c(6) 
c(l) = {l, 3, 5, 7} = c(3) = e(5) = c(7) 
1 1 . (b) The proof follows from the biconditional equivalence in Table 3.4.2. 

(c) Apply the chain rule. 

(d) 



I 



I 

I 



Section 6.4 




u 


4 5 6 


1 


fO ^ 


Ma) 2 


1 


3 


1 


4 


y0 1 J 



and 



D 


6 


1 


8 


4 


(0 





\ 


5 


1 








6 


K0 


1 


J 
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□ 


6 


1 


8 


1 


'0 





\ 


2 











3 


1 








4 


(0 


1 


J 



(b) 



3 . R : xry if and only if | x - y \ = 1 . 
S : xsy if and only if x is less than y. 

5 . The diagonal entries of the matrix for such a relation must be 1 . When the three entries above the diagonal are determined, the entries below 
are also determined. Therefore, the answer is 2 3 . 



7. (a) 



(b) 



□ 


1 2 


3 


4 










□ 


1 


2 


3 


4 


1 


(0 1 





"i 








i 


(\ 





1 


< 


2 


1 


1 







and 




2 





1 





1 


3 


1 





1 








3 


1 





1 





4 


v 


1 


o, 








4 


v0 


1 





1 , 


□ 


1 


2 


3 


4 














1 


'0 


1 





A 
















PQ= 2 


1 





1 



















3 





1 





1 
















4 


.0 





1 


, 
















□ 


1 


2 


3 


4 














1 


'0 


1 





A 
















P 2 = 2 


1 





1 





= 


e 2 












3 





1 





1 


















4 


.0 





1 


0; 

















9. (a) Reflexive: Ry = Ry for all i, j, therefore Ry < Ry 

Antisymmetric: Assume Ry < Sy and Sy < Ry for all 1 < ;', j < n =* Ry < Sy 

Transitive: Assume R, S, and T are matrices where Ry < Syand Sy < Ty, for all 1 < ;', j < n. Then Ry < Ty for all 1 < i, j < n, and so 
R<T. 

(b) (R )y = Rji Rij + R a R2j + • • • + Rj n Rnj 

< S u 5, j + S i2 S 2J + -+ S in S nj = (S 2 )y => R 2 < S 2 

To verify that the converse is not true we need only one example. For n = 2, let R ]2 = 1 and all other entries equal 0, and let S be the 
zero matrix. Since R 2 and S 2 are both the zero matrix, R 2 < S 2 , but since R l2 > S] 2 , R < S is false. 

(c) The matrices are defined on the same set A = {ai, a 2 , ... , a n ). Let c(ai), i = 1, 2, ... , n be the equivalence classes defined by R and let 



d(aj) be those defined by S. Claim: c(a,) c d(a,). Let cij e c(a t ) => a, ra ; - => Ry = 1 => 5y = 1 



• a ; - e d(cij) . 



Section 6.5 

3. (a) 
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G— -O 



© — 6 




© 



© 

© 




(b) Example, 1 s 4 and using S one can go from 1 to 4 using a path of length 3. 

5. (a) Definition: Reflexive Closure. Let r be a relation on A. A reflexive closure ofr is the smallest reflexive relation that contains r. 

Theorem : The reflexive closure ofr is the union ofr with {(x, x) : x e A) 

7. (a) By the definition of transitive closure, r + is the smallest relation which contains r; therefore, it is transitive. The transitive closure of r + , 
(r + ) + , is the smallest transitive relation that contains /' + . Since r + is transitive, (r + ) + = r + . 

(b) The transitive closure of a symmetric relation is symmetric, but it may not be reflexive. If one element is not related to any elements, 
then the transitive closure will not relate that element to others . 

Supplementary Exercises— Chapter 6 

1 . If Andy is the parent of Barbara and Barbara is the parent of Charles , then Andy is the grandparent of Charles . 

3. (a) r = {(-l,0),(0, 1),(1,2)} 

(b) s = {(-1, -1), (-1, 1), (0, 0), (1, -1), (1, 1), (2, 2)) 

(c) f = ((-l, 0), (-1, 1), (-1, 2), (0, -1), (0, 1), (0, 2), 

(1,-1), (1,0), (1,2), (2, -1), (2,0), (2, 1)} 

5. His main office should be at node 2. The least desirable location is at node 1. The arrows in both directions between nodes 1 and 2 represent 
a two-way street. 

7. (a) No. 

(b) Person a is friendly toward the most people so he/she would be chair person. 

(c) If "great personality" has any effect then person b becomes chairperson. 

(d) A seating arrangement does not exist, since persons c and d are only friendly toward one person each and they have to be seated 
between two people they are friendly toward. 

9. In order for the relation "living in the same house" to be an equivalence relation we must assume that a person lives in only one house. 
1 1 .(a) r is an equivalence relation. 

(b) s is neither since s is not reflexive. 

(c) In order for s to be a partial ordering we rephrase it slightly; xsy iff x taller than y or x equals y. Why would xsy iff x is the same height 
as or taller than y be wrong? 

13. There are 16 places in the adjacency matrix for a relation on four elements, but for a symmetric relation those entries below the diagonal 
will be the same as above. Hence we are only concerned with 16 - 6 = 10 places. Each of the remaining entries may take on a value of either 
or 1, so by the rule of products we have 2 10 possible symmetric relations on a four element set. 
15. 
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15. (a) 



(b) 



(c) 





(i) ((a, a), (a, b), (b, a), (b, b), (c, c)} 
(ii) {(a, a), (a, c), (c, a), (c, b), (c, c)) 







f 1 1 


\ 


(i) fl 2 = 


R = 


1 1 
,001, 




(\ 1 1 \ 


(ii) R 2 = 









{ 1 


1 1 ; 





17. 



® 




ih; 




(b) r is not reflexive, not symmetric, not antisymmetric, and not transitive. 



(c) R + . 



19. 



(\ 1 1 

10 

10 

I 1 1 1 J 

(a) A 5 is friendly to no one. 

(b) The U.S. Ambassador (Ai) should be the chairman of this committee, since he is friendly toward the most people. 

(c) The U.S. Ambassador can communicate to everyone on the committee. 



CHAPTER 7 

Section 7.1 

1. (a) Yes 



(b) Yes 



(c)No 



(d)No 



(e) Yes 



Applied Discrete Structures by Alan Doerr & Kenneth Levasseur is licensed under a Creative Commons Attribution-Noncommercial- No Derivative Works 3.0 United States 



275 275 

Solutions to Odd Numbered Exercises 



3. (a) Range of / = /(A) = {a, b, c, d) = B 

(b) Range of g = g(A) = [a, b,d) 

(c) Range of L = L(A) = {1) 

5 . For each of the | A | elements of A, there are | B | possible images, so there are | B | • \B\ -...-\B\ = | B |I A I functions from A into B. 

Section 7.2 

1 . The only one-to-one function and the only onto function is /. 

3. (a) onto but not one-to-one (/i(0) =/i(l)) 

(b) one-to-one and onto 

(c) one-to-one but not onto 

(d) onto but not one-to-one 

(e) one-to-one but not onto 

(f) one-to-one but not onto 

5. Let X = {socks selected) and Y = {pairs of socks) and define / : X -» Y where f(x) =the pair of socks that x belongs to . By the Pigeonhole 
principle, there exist two socks that were selected from the same pair. 

7. (a) f(n) = n, for example 

(b) f(n) = 1 , for example 

(c) None exist. 

(d) None exist. 

9. (a) Use s : N -+ P defined by s(x) = x + 1 . 

(b) Use the function/ : Rl -> Z defined by /(xO = x/2 if x is even and f(x) = -(x + l)/2 if x is odd. 

(c)The proof is due to Georg Cantor (1845-1918), and involves listing the rationals through a definite procedure so that none are omitted and 
duplications are avoided. In the first row list all nonnegative rationals with denominator 1, in the second all nonnegative rationals with denomina- 
tor 2 , etc . In this listing , of course , there are duplications , for example ,0/1 =0/2 = 0, 1/1 =3/3 = 1,6/4 = 9/6 = 3/2, etc. To obtain a list 
without duplications follow the arrows in the given array listing only the circled numbers. 



start 




We obtain: 0, 1, 1/2, 2, 3, 1/3, 1/4, 2/3, 3/2, 4/1, ... Each nonnegative rational appears in this list exactly once. We now must 
insert in this list the negative rationals, and follow the same scheme to obtain: 0, 1, -1, 1/2, -1/2, 2, -2, 3, -3, 1/3, -1/3, ... , which can 
be paired off with the elements of fsl . 

1 1 . Let / be any function from A into B. By the Pigeonhole principle with n = 1 , there exists an element of B that is the image of at least two 
elements of A. Therefore, / is not an injection. 

13. The proof is indirect and follows a technique called the Cantor diagonal process. Assume to the contrary that the set is countable, then the 
elements can be listed: 

tli, Hz, H3, ... where each n, is an infinite sequence of 0s and Is. Consider the array: 
n, =n n n n n l3 ■■■ 

"2 = "21 "22 "23 ••• 

n 3 = n 3l n 32 n 33 ■■■ 

We assume that this array contains all infinite sequences of 0s and Is. Consider the sequence s defined by 

if «;; = 1 

1 if/!ii=0 
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s differs from each n, in the rth position and so cannot be in the list. This is a contradiction, which completes our proof. 

Section 7.3 

1. (a) gof ; A -» C is defined by (g°f) (k) = < 

(b) No, since the domain of/ is not equal to the codomain of g. 

(c) No, since / is not surjective. 

(d) No, since g is not injective. 
3. (a) The permutations of A are ;', r\ , r 2 , / , / 2 , and fa, defined in section 15.3 

(b,c) 



+ if £ = 1 or £ = 5 
if*: = 2, 3,4 



Inverse of 


Square of 


the permutation 


permutation 


i 


i 


i'i 


l'2 


''i 


>'\ 


/. 


i 


fi 


i 


h 


i 



Permutation 
i 

r i 
i'i 
/. 
h 

h 

(d) Apply both Theorems 7.3.3 and 7.3.4: If/ and g are permutations of A, then they are both 
injections and their composition, f°g, is a injection, by Theorem 7.3.3. By 7.3.4, f°g is also a 
surjection; therefore, f°g is a bijection on A, a permutation. 

(e) Proof by induction: 

BasiS: (n = 1) The number of permutations of A is one , the identity function , and 1 ! = 1 . 

Induction: Assume that the number of permutations on a set with n elements, n > 1, is n\. Furthermore, assume that \A\ = n + 1 
and that A contains an element called x. Let A' = A - {x}. We can reduce the definition of a permutation, /, on A to two steps. First, we select 
any one of the n\ permutations on A'. (Note the use of the induction hypothesis.) Call it g. This permutation almost completely defines a 
permutation on A by f(a) = g(a) for all a in A', Next, we select the image of x, which can be done n + 1 different ways. To keep our function 
bijective, we must adjust / as follows: If we select f(x) = y, then we must find the element, z, of A such that g(z) = y, and redefine the image of 
Z to f(z) = x. If we had selected f(x) = x, then there is really no adjustment needed. By the rule of products, the number of ways that we can 
define / is n! (n + 1) = (n + 1) ! ■ 

7.(a)/°g(n) = n + 3 (b) fin) = n + 15 (c) f°h(n) = n 1 + 5 

9. Theorem: If / : A -» B and / has an inverse, then that inverse is unique. 

Proof: Suppose that g and h are both inverses of/. 

g = g°iAg 
= g°(f°h) 
= (g°f)°h 
= i A °h 
= h =$ g = h # 

11. Proof of Theorem 7.3.2: Let x, x' be elements of A such that g°f(x) = g°f(x'); that is, g (f(x)) = g(f(x')). Since g is injective, f(x) = f(x') 
and since / is injective, x = x'.M 

Proof of Theorem 7.3.3: Let x be an element of C. We must show that there exists an element of A whose image under gof is x. Since g is 
surjective, there exists an element of B, y, such that g(y) = x. Also, since / is a surjection, there exists an element of A, z, such that f(z) = y, 
g°f(z)=g(f(z)) = g(y) = x.m 

13. Basis: (n = 2): (/ °/ 2 )-' = ./r 1 °/r 2 by exercise 10. 
Induction : Assume n > 2 and (/i °/ 2 ° • • • °f„T l = 



f,r [ °---°f2- [ °fr [ 



Consider (/ °/ 2 ° •••°/„ +1 ) 



l 

-l 



(/i°/2°---°/„ + ir' =((/r/2°--- o / ! )°/ !+ ir 1 

by the Basis = / B+I _I °(/ °/ 2 ° ••• °/„)~' 

by Induction hypothesis = f„+i~ [ °(/r' ° ' • • °fi "' °/i _1 ) 
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=/ n+ r 1 °-°/2- 1 °/r 1 ." 

15. Assume all functions are functions on A. 

(f°g)°h = f°(g°h) 

f°'lA =iA°f = f 

If/ -1 and g' 1 exist, (g°f)~ l = f' 1 °g~ [ and 

If/- 1 exists, (/-')-'=/. 

Supplementary Exercises— Chapter 7 

l.(a) Z (b) Z (c)/(-5) = 2|5 |+1 = 11 

(d) {1,3,5,7,9, ...) = the set of odd positive integers 

(e) No, a = 5 or a = -5. 

3. No. Relations (iii) and (iv) are not functions because the domain is not all of the set A. The others are not functions since in each case at least 
one element of A is mapped to 2 different elements. Example for relation (i), a is mapped to both a and b. 

5. (a) The matrix of/ can only have one 1 in each row. So if the domain of/ has n elements the matrix of/ will have n Is. 
(b) If/ is a bijection, besides having only one 1 in each row, there can only be one 1 in each column. 

7. (a) Let /(«) = n 2 , Vnefsl. Since / is a bijection from ^ into A = {n 2 | n e N\, N and A have the same cardinality; so A is countable. 

(b) Let B = {1 /n | n e P}. g : Rl -» fi defined by g(n) = 1 j(n + 1) is the required bijection. 

(c) That C = C\ U C 2 = {3, 9, 27, 81, . ..} U {2, 4, 8, 16, ...}is countable follows from the proof of Exercise 8. Without using this proof, we 

can still prove that C is countable by using the list 2 1 , 3 1 , 2 2 , 3 2 , 2 3 , 3 3 to define h : N -» C where h(a) =the number in position a + 1 in the list, 

9./:AxB->BxA defined by /(a , b) = Q>, a) is a bijection , which is all that we need to prove that 
|AxB| = |BxA| 

1 1 . This "code" can be viewed as a function, a, on the set of all finite sequences of letters. For example, a(hat) = qmh. This encoding function 
will not work very well because it is not a bijection. For example, no sequence with a or I in it is in the range. Although a is not one-to-one, it is 
difficult to find two English words with the same image. 

13. (a) 10 (a + 10) (b) a + 20 (c) 10a div 10 = a 

(d) (a + 10) div 10 = a div 10 + 1 

15. (a) /(£) = /> and /(c) = c 

(b) f(b) = a and f(c) = d 

( a b\ 
17. Since det\ = ad - be there are four permutations of [a, b, c, d) that leave the determinant invariant. These permutations are the 

identity function, tt] = {(a, d), (b, b), (c, c), (d, a)\, a 2 = {(a, d), (b, c), (c, b), (d, a)}, and or 3 = {(a, a), (b, c), (c, b), (d, d)}. 
19. (a) Domain = positive real numbers, Codomain = Real numbers. 

CHAPTER 8 

Section 8.1 

l.C(5,2)= C(4, 2) + C(4, 1) 

= C(3, 2) + C(3, 1) + C(3, 1) + C(3, 0) 

= C(3,2) + 2C(3, 1) + 1 

= C(2, 2) + C(2, 1) + 2 (C(2, 1) + C(2, 0)) + 1 

= 3C(2, l) + 4 

= 6 + 4= 10 

3. (a) p (x) in telescoping form: ((((x + 3) x - 15) x + 0) x + 1) x - 10 

(b) p(3) = ((((3 + 3) 3 - 15) 3 - 0) 3 + 1) 3 - 10 = 74 
5 . The basis is not reached in a finite number of steps if you try to compute f(x) for a nonzero value of x. 

Section 8.2 
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1. BasiS: B(0) = 3 -0 + 2 = 2, as defined 

Induction: Assume: B(k) = 3 k + 2 for some £ > 0. 

B(k + 1) = B(£) + 3 

= (3 k + 2) + 3 by the induction hypothesis 

= (3 k + 3) + 2 

= 3 (£ + 1) + 2, as desired. ■ 

3. Imagine drawing line k in one of the infinite regions that it passes through. That infinite region is divided into two infinite regions by line k. 
As line k is drawn through every one of the k - 1 previous lines, you enter another region that line k divides. Therefore, the number of regions 
is increased by k. 

5 . For n greater than zero, M(n) = M(n - 1) + 1 , and M(0) = 0. 

Section 8.3 

1. S(k) = 2 + 9 k 

3. S(k) = 6 (1/4)* 

5. S(k) = k 2 - 10 £ + 25 

7. S(k) = (3 + k) 5* 

9. S0t) = (12 + 3£) + (£ 2 +7£-22)2*- 1 

ll.P(fc) = 4(-3)* + 2*-5* +1 

1 3 . (a) The characteristic equation isaa 2 -a-l=0, which has solutions a = 1 1 + v5 I / 2 and /? = 1 1 - V5 I / 2 , It is useful to point out that 

a + B = 1 and a - B = y5 . The general solution is 

F(k) = bia k + b 2 B k . 
Using the initial conditions, we obtain the system: b\ + b 2 = 1 and b t a + b 2 R = 1 . The solution to this system is 

by = a/(a -B) = {5 + VT)/2 V5~ 
and fo 2 = /3/(a-/3) = (5- V5~)/2V5~ 
Therefore the final solution is 

w = (./VF)[((. + VT)/ 2 r-((.-VT)/2n 

(b)C r =F(r+l) 

15. (a) £>(«) = 2 D(n- 1)+ 1 forrc > 2and£>(l) = 0. 

(b) £>(«) = 2"- 1 - 1 

17. Solutions to the recurrence relation and its approximation are B(k) = (1 + c)* + (1 - c) k and B„(£) = 1. Note how as k increases, B(k) grows 
in size, while B a (k) stays constant. 

Section 8.4 

1 . (a) S(n) = 1/nl (c)U(k) = 1 /k, an improvement. 

(b) T(k) = (-3)* k\ , no improvement. 

3. (a) T(n) = 3 (Llog 2 n\ + 1) (c) V(n) = Llog 8 n\ + 1 

(b) Tin) = 2 
5. The indicated substitution yields S(n) = S(n+\). Since 5(0) = T(l)/T(Q) = 6, 5(n) = 6 for all n. Therefore T{n + 1) = 6 r(n) ^ T(n) = 6". 

7. (a) A good approximation to the solution of this recurrence relation is based on the following observation: n is a power of a power of two; 
that is, n is 2'", where m = 2 k , then Q(n) = 1 + (2(2'" /2 ). By applying this recurrence relation k times we obtain Q(n) = k. Going back to the 
original form of n, log 2 n = 2 k or log 2 (log 2 n) = k. We would expect that in general, Q(n) is [log 2 (log 2 n)\. We do not see any elementary 
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method for arriving at an exact solution. 

(b) Suppose that n is a positive integer with 2*~' <n<2 k . Then n can be written in binary form, (a k _ l a k _ 2 ■■■ a 2 a\ ao)t W o with a k-\ - 1 anc ' 
R(n) is equal to the sum 

*t-l 

2u (flu dk-i ' ' ' a ;)two 

If 2*~' < n < 2 k , then we can estimate this sum to be between 2 n - 1 and 2 n + 1 . Therefore, R(n) ~2n. 

Section 8.5 

l.(a) 1,0,0,0,0,... (b) 5(1/2)* (c) 1, 1, 0, 0, 0, ... 

(d) 3 (-2)* + 3-3* 

3. (a) 1/(1 -9z) (b)(2-10z)/(l-6z + 5z 2 ) 

(c) 1/(1 -z-Z 2 ) 

5. (a) 3/(1 -2z) + 2/(1 + 2z), 3-2* + 2(-2)* 

(b) 10/(1 -z) + 12/(2 -z), 10 + 6(1/2)* 

(c) -l/(l-5z) + 7/(l-6z), 7-6*-5* 

7. (a) 11 k 

(b)(5/3)k(k+ l)(2k + l) + 5k(k+ 1) 

k k k 

(c) Zo) do (* - /)) = io t Z j - io S / 

= 5 /t 2 (/t + 1) - (5 fc(* + 1) (2 * + l)/6) 
= (5/3)k(k+ l)(2k+ 1) 

(d) fc(*+ l)(2£ + 7)/12 

9. Coefficients of z° through z 5 in ( 1 + 5 z) (2 + 4 z) (3 + 3 z) (4 + 2 z) (5 + z) 

fc Number of ways of getting a score of fc 

120 

1 1044 

2 2724 

3 2724 

4 1044 

5 120 

Supplementary Exercises— Chapter 8 

1. Let v(n) be the quantity in question. Since any positive digit can appear in a one-digit positive integer, v(l) = 9. Given an n digit number, 
n > 2, it can be thought of as an n - 1 digit number times ten plus a digit. This digit cannot be the same as the units digit of the n - 1 digit 
number. Therefore, by the product rule v(n) = 9 v(n - 1) for n > 2. 

3 . (a) To execute Split with L in = ( 1 , 2 , 3 , 4) , we must split the list into LI = ( 1 , 3) and L 2 = (2, 4) . If you carefully examine the algorithm for 
a list of length 2, you will see that the output equals the input; therefore L 1 out = (1, 3) and L 2 out = (2, 4) and Lout = (1, 3, 2, 4). 
(b) Examine the results for r = 1, 2, 3 with numbers in binary form. Notice the symmetry with respect to the vertical line. 



L in L out 



1 1 



L in L out 

r=\ ~ " ~ ~" 000 000 

001 100 

010 010 

r = 3 011 110 

100 001 

101 101 

110 011 

111 111 



L in L out 
00 00 
r = 2 01 10 

10 01 

11 11 

The integers in L out are sorted so that (fo r _i b r - 2 • • • &o)two appears in position (bo b\ ■■■ b r -i) tv/0 . 
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5. This is not a closed form expression because the number of operations that are needed to compute the expression grows with n, B (n) in this 
form requires n additions and n — 1 multiplications . 

7. Kathryn's balance on her first birthday is $1 = 6(1). If B (n) is her balance on her nth birthday, n > 2, then B{n) =1.1 B(n - 1) + n. 

B(n) = 6<">(n) + B ( P\n) = b\(\l) n - (10 n + 1 10) 
B(l) = 1 =>(l.l)6i = 121 =>£>! = 1.1 

Therefore B{n) = 121 (LI)"" 1 - (10 n+ 1 10). On her 21st birthday, Kathryn will have 5(21) = 121 (1 .l) 20 - (210 + 1 10) = $494 .03. 

9. (a) If it takes X (n) moves to move n disks to peg 2, then we can transfer the n + 1 disk to peg 3 in one move and then transfer the n disks 
from peg 2 to peg 3 in X(n) moves, so X(n + 1) = X(n) + 1 + X(n) = 2 X(n) + 1 , or equivalently X(ri) = 2 X(n - 1) + 1 . 

(b) X(n) = b\ -2" - 1 . Since it takes 1 move to transfer 1 disk from one peg to another, Z(l) = 1 ; SO b\ = 1 and X(n) = 2" - 1 . We verify that 
X(3) = 7: 



St4JlL 1 

2 
i 






-*(4) 








2 


3 



-<L) 



— W 



2 

1 




1 


1 


3 


J 



-<"2) 



-(6) 



j : i 

2 
1 3 



K3> 



-1.7) 



11. The solution for n = A k is (2(4*) = - (4* +l - l), This can be obtained in one of two ways. Either use the substitution S(k) = 2(4*), which 
yields S(k) = 4* + S(k - 1), or note that 2(4*') = 4* + 2(4*"') = 4* + 4*" 1 + 2(4*~ 2 ) = 4* + 4*~' + ■ ■ ■ + 4 + 1 . This finite geometric series has the 
closed form expression above. By similar means , 2(2-4*) = 2 g(4*) = - (4* +1 - l) 

13. G(S; z) = 1 +z + 2z 2 +4z 3 + 8z 4 ■•• 
15. G(T; z) = G(S; cz) = 2 S(k) {czf = 

= I*(S(k)c k )z k 

k=0 

Therefore, T(k) = S(k)c k . 
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| CHAPTER 9 



Section 9.1 

1. In Figure 9.1.2, computer b can communicate with all other computers. In Figure 9.1.3, there are roads to and from city b to all other cities. 
In Figure 9.1 .4, there is a door connecting room b to every other room in the house (including the outside). 

3. 




5. No, the maximum number of edges would be = 28. 

b 2 



7. (a) C(n, 2) = ^ 

(b) n - 1 , one edge for each vertex except the champion vertex. 

Section 9.2 
A Exercises 

1 . Estimate the number of vertices and edges in each of the following graphs. Would the graph be considered sparse? 

(a) A rough estimate of the number of vertices in the "world airline graph" would be the number of cities with population greater than or equal 
to 100,000. Using the Wolfram CityData function we can get this number: 

CityData[All] // Select [B, CityData [B, "Population"] >= 10 ~ 5 &] & // Length 

4257 

There are many smaller cities that have airports, but some of the metropolitan areas with clusters of large cities are served by only a few 
airports. 4000-5000 is probably a good guess. As for edges, that's a bit more difficult to estimate. It's certainly not a complete graph. 
Looking at some medium sized airports such as Manchester, NH, the average number of cities that you can go to directly is in the 50-100 range. 

So a very rough estimate would be — = 168 750. 



(b) The number of ASCII characters is 128. Each character would be connected to 8 others and so there are 



: 512 edges. 



(c) The Oxford English Dictionary as approximately a half- million words, although many are obsolete. Mathematica has a words database 
that is a bit less comprehensive, yet it isn't trivial. The nice thing about using the Wolfram data is that the number of vertices and edges can be 
counted. Here is the number of words and hence the number of vertices in the graph: 

WordData[All] //Length 
149191 

And here are the number of edges, determined in a not necessarily efficient manner. 

(Length [WordData [8 ~~ , "Lookup"]] & /@ WordData [All] ) // Apply [Plus, 8] & 

876547 

The last output took about three hours using a MacBook Pro with a 2.5 GHz Intel Core 2 Duo Processor. It should also be pointed out that 
Wolfram's choice of "words" doesn't match the OED or a Scrabble player's dictionary. For example, "1900s" and "18-karat gold" are 
included among the list of words. Nevertheless, the number we have here are good ballpark estimates for most most interpretations of the 
"English words." 

3. Each graph is isomorphic to itself. In addition, G? and G4 are isomorphic; and G3, G 5 , and G(, are isomorphic. 

Section 9.3 
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k 12 3 4 5 6 

V[k] .found T T T F F T 

1 - unit o c a « (* = undefined) 

^[fcj.from 2 5 6**5 

Depth Set 2 12**1 

3. If the number of vertices is n, there can be — vertices with one vertex not connected to any of the others. One more edge and connectiv- 
ity is assured. 

5 . Basis: (k = 1) Is the relation ^'defined by vr 1 w if there is a path of length I from v to w? Yes, since vrw if and only if an edge, which is a path 
of length I, connects v to w. 

Induction: Assume that vr k w if and only if there is a path of length k from v to w. We must show that vr k+[ w if and only if there is a path of 
length k + 1 from v to w. 

vr k +' w => vr k y and yrw, for some vertex y. By the induction hypothesis, there is a path of length k from v to y. And by the basis, there is 
a path of length one from y to w. If we combine these two paths, we obtain a path of length k + 1 from v to w. Of course, if we start with a path 
of length k + 1 from v to w, we have a path of length k from v to some vertex y and a path of length / from y to w. Therefore, 

vr k y and yrw => vr* + ' w m 

Section 9.4 

1. Using a recent road map, it appears that a Eulerian circuit exists in New York City, not including the small islands that belong to the city. 
Lowell, Massachusetts, is located at the confluence of the Merrimack and Concord rivers and has several canals flowing through it. No Eulerian 
path exists for Lowell. 

3. Gray Code for the 4-cube: 

(0000 
0001 
0011 
0010 
0110 
0111 
0101 
0100 
1100 
1101 

1111 

1110 
1010 
1011 
1001 
I 1000 ) 

5. Any bridge between two land masses will be sufficient. To get a Eulerian circuit, you must add a second bridge that connects the two land 
masses that were not connected by the first bridge. 

7. Theorem: Let G = (V, EI) be a directed graph, G has a Eulerian circuit if (a) G is connected and (b) indeg(v) = outdeg(v) for all v in V. 
There exists a Eulerian path from v t tov 2 if (a) g is connected and (b) indeg(vi) = outdeg(vi) - 1, indeg(v2) = outdeg(v2) + 1 and for all other 
vertices in V the indegree and outdegree are equal. 

9. A round-robin tournament graph is rarely Eulerian. It will be Eulerian if it has an odd number of vertices and each vertex (team) wins exactly 
as many times as it loses. Every round-robin tournament graph has a Hamiltonian path. This can be proven by induction on the number of 
vertices . 

Section 9.5 

1. The circuit would be Boston, Providence, Hartford, Concord, Montpelier, Augusta, Boston. It does matter where you start. If you start in 
Concord, for example, your mileage will be higher. 

3. (a) Optimal cost =2^2". 

Phase 1 cost =2.4 VT . 

Phase 2 cost = 2.6 VT . 
(b) Optimal cost =2.60. 
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Phase 1 cost =3.00. 

Phase 2 cost 2 y/T . 

(c) A = (0.0, 0.5), B = (0.5, 0.0), C = (0.5, 1.0), D = (1.0, 0.5) 

There are 4 points; so we will divide the unit square into two strips. 

Optimal Path: (£, A, C, D) Distance = 2 a/2~ 

Phase I Path: (B, A, C, D) Distance = 2 V2~ 

Phase II Path: (A, C, B, D) Distance = 2 + V2~ 

(d) A = (0, 0), B = (0.2, 0.6), C = (0.4, 0.1), D = (0.6, 0.8), E = (0.7, 0.5) 

There are 5 points; so we will divide the unit square into three strips. 

Optimal Path: (A, B, D, E, C) Distance = 2.31 

Phase I Path: (A, C, B, C, E) Distance = 2.57 

Phase II Path: (A, B, D, E, C) Distance = 2.31 

5. (a) f{c, d) = 2, f{b, d) = 2, f(d, k) = 5, f{a, g) = 1, and/(g, k) = 1. 

(b) There are three possible flow-augmenting paths. 

s, b, d, k with flow increase of 1 . 
s, a, d, k with flow increase of 1 , and 
s, a, g, k with flow increase of 2. 

(c) The new flow is never maximal, since another flow-augmenting path will always exist. For example, if s, b, d, k is used above, the new 
flow can be augmented by 2 units with s, a, g, k. 

I . (a) Value of maximal flow =31. 

(b) Value of maximal flow =14. 

(c) Value of maximal flow =14. 
One way of obtaining this flow is: 

Step Flow - Augmenting Path Flow Added 

1 Source, A, Sink 2 

2 Source, C, B, Sink 3 

3 Source, E, D, Sink 4 

4 Source, A, B, Sink 1 

5 Source, C, D, Sink 2 

6 Source, A, B, C, D, Sink 2 

9. To locate the closest neighbor among the list of k other points on the unit square requires a time proportional to k. Therefore the time required 
for the closest-neighbor algorithm with n points is proportional to (n - 1) + (n - 2) + ••• + 2 + 1, which is proportional to n 2 . Since the strip 
algorithm takes a time proportional to n (log n), it is much faster for large values of n. 

II. LetP = Pi, P 2 , . .., P 2 „ be a set of points in the unit square. If 5 is a subset of P, define min(S) to be the point in S with smallest x coordi- 
nate. If there is a tie, select the point with smallest y coordinate. 

Matching Algorithm: 

1. S:=P 

2. While S * Do 

2.1 v:=min(5) 

2.2 w := closest point to v in S - {v) 

2.3 pair up v and w 

2.4 S:=S-{v, w] 

Although this could be classified as a closest-neighbor algorithm, there is a better one, but it is more time-consuming. 
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Section 9.6 

1 . Theorem 9.6.2 can be applied to infer that if n s 5 , then K„ is nonplanar. A Ka, is the largest complete planar graph. 

3. (a) 3(b) 3 (c) 3 (d) 3 (e) 2 (f) 4 

5.n 

1 . Suppose that G' is not connected. Then G' is made up of 2 components that are planar graphs with less than k edges, G\ and Gi- For i = 1 and 

2. let Vj, rj, and e, be the number of vertices , regions and edges in G;. 

By the induction hypothesis: 

Vj + )-j - e\ = 2 

and V2 + r% - ei = 2 

One of the regions, the infinite one, is common to both graphs. Therefore, when we add edge e back to the graph, we have 
r = r\ + )'2 - 1 , v = V] + V2, and e = e\ + £2 + 1 ■ 

v + r - e + (vi + v 2 ) + ()'i + r 2 - 1) - (ej + e 2 + 1) 
= (vi + n - ei) + (v 2 + )"2 - £2) - 2 

= 2 + 2-2 
= 2 ■ 

9. Since |£| + E L '= — — , either EotE c has at least — — elements. Assume that it is E that is larger. Since — — is greater than 
3 n - 6 for n & 1 1 , G would be nonplanar. Of course, if E c is larger, then G' would be nonplanar by the same reasoning. 

11. Suppose that (V, E) is bipartite (with colors red and blue), \E\ is odd, and (vi, v 2 , ..., v 2 „ + i, Vi) is a Hamiltonian circuit. If vi is red, then 
v 2n+i would also be red. But then (v2„ + i , Vi ) would not be in E, a contradiction. 

13. Draw a graph with one vertex for each edge, If two edges in the original graph meet at the same vertex, then draw an edge connecting the 
corresponding vertices in the new graph. 

Supplementary Exercises— Chapter 9 

1. Graphs G\ and G2 are isomorphic. One isomorphism between them is {(a, e), (b, h), (c, /), (d, g)}. To see that G3 is not isomorphic to the 
other two notice that k and 7 are not connected by an edge while in Gi and G2 every pair of vertices is connected. 

3. (a) {a, e) is a maximal independent set in Figure 9.1.2. 

(b) (By contradiction) Assume that W is a maximal independent set in G. If V is not connected to any vertex, W [J {v} is independent, and since 
this is a larger set, Wis not maximal. 

(c) A single vertex is maximal; no larger set can be independent. 
5. (a) 




Guatemala ^ 

r — Honduras 



ElSalvador 

Nicaragua 



CostaRica 



(b) (Mexico, Guatemala, Belize, Nicaragua, CostaRica, Panama) 

(c) This path could be a list of the countries that you would go through in your trip. 

7. (a) If one source s exists, then (s, v) is on the edge of the round-robin tournament graph for each vertex v different from s. Therefore no other 
vertex could be a source. By similar reasoning, only one sink can exist. In a round-robin tournament, only one team can be unbeaten and only 

Applied Discrete Structures by A. Doerr & K. Levasseur is licensed under a Creative Commons Attribution- NonCommercial-ShareAlike 3.0 United States 



285 



Solutions to Odd Numbered Exercises 



285 



one can be winless. 

(b) If |V| = n, outdeg (source) = indeg (sink) 



n-1 



(c) Let V = {vi , V2, . . ., v„}. The following graph demonstrates that p /\ "■ <? is possible. Similar graphs can be drawn for the other situations. 




k 123456789 

V[£].name abcdefgh i 
9. V[fe] .found TTTTTTTTT 

V[fe].from 4 15 5 13 5 5 6 

depth set 3 12 2 13 2 2 4 
1 1 . Gi is randomly Eulerian from no vertex, yet it is Eulerian. 

G2 is randomly Eulerian from only vertex 1 . 

G3 is randomly Eulerian from only vertices 1 and 2. 

G4 is randomly Eulerian from every vertex. 
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c 2 






c 4 



13. Addition of edges to E will certainly not decrease the degrees of each vertex. After adding some edges to E until no more can be added 
without allowing a Hamiltonian circuit, select e = {vi, v„} not in the new, larger E. Since a Hamiltonian circuit exists in (G, E [J \e\), there is a 
path in G that visits every vertex in the order Vi, V2, ...V„. Now for 2 < i < n, if {vi, v,j e E, then 



Not both of the thicker red edges can be in E. 




{v;_i, v„} £E, for otherwise, (vi, V2, ..., Vj_i, v„, v„, v n _i, .,., v,-, vi) is a Hamiltonian circuit. 

Since {vi , v,j e£=> {v,_i , v„} £ i? <=> -1 ({vi , v,} e E and {v,_i , v„) e £), no more than n - 1 of the possible edges that connect Vj and v„ to other 
vertices could be in E, even after adding edges to E. Therefore, for the original graph, with {vj, v„] £ E, deg Vj + deg v„ <n,a contradiction. 

15. (a) f{b, d) = f(c, d) = f(a, g) = f(y, t) = 1, f(d, t) = 2, V(f) = 3. 

(b) One flow-augmenting path is (s, a, g, t), which increased the flow value by 1, to 4. (A second one is (s, b, d, a, g, /)■) 

(c) The new flow is maximal since its value is equal to the sum of capacities into the sink. 

17.(a) (A, D, F, E, C, B, A) 

(b) Starting at any city, it would take n — 2 seconds to decide where to go first. Then it would take n - 3 seconds from the next step, and so on. 
The total time would be 



in - 2) + (n - 3) + — + 2 + 1 + = 



2(n-2)(rc-l) 



■ seconds 



seconds, when n is large. 
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19. 




| CHAPTER 10 



Section 10.1 

1 . The number of trees are: (a) 1 , (b) 3, and (c) 16. The trees that connect V c are: 

N [n~| [N~| 



E W E W 



E W E W 



N 



W 



E W 



E W 



E W 



S 



s s 

3. Hint: Use induction on \E\. 
5. (a) Assume that (V ', E) is a tree with \V\ > 2, and all but possibly one vertex in Vhas degree two or more. 
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2\E\ = Zdeg(v)>2\V\-l 
veV 

or |£|>|V| - -=>|£|>|V|=>(V, £) is not a tree. 

(b) The proof of this part is similar to part a in that we get 2 \E\ > 2 | V\ - 1 , since a tree that is not a chain has a vertex with degree three or more. 

Section 10.2 

1 . It might not be most economical with respect to Objective 1 . You should be able to find an example to illustrate this claim. The new system 
can always be made most economical with respect to Objective 2 if the old system were designed with that objective in mind. 

3. In the figure below, {1, 2) is not a minimal bridge between L = (1, 4)andR = {2, 3), but it is part of the minimal spanning tree for this graph. 




5, (a) Edges in one solution are: 

{8, 7), {8. 9), (8, 13}. (7, 6). {9. 4), {13. 12), (13, 14), {6, 11), {6, 1), (1,2), {4, 3), {4, 5) {14, 15), and {5, 10) 

(b) Vertices 8 and 9 are at the center of the graph. Starting from vertex 8, a minimum diameter spanning tree is 
{{8, 3), {8, 7), {8, 13), {8, 14), {8, 9), {3, 2), {3, 4), {7, 6), {13, 12), {13, 19), {14, 15), {9, 16), 

{9, 10), {6, 1), {12, 18), {16, 20), {16, 17), {10, 11), {20, 21), {11, 5)). The diameter of the tree is 7. 

Section 10.3 

1. Locate any simple path of length d and locate the vertex in position [rf/2]on the path. The tree rooted at that vertex will have a depth of 
|"d/2~|, which is minimal. 

3. 
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Section 10.4 

l. 



(a) 



(b) 





(d) 



(O 





(e) 






Preorder 


Inorder 


Postorder 


(a) 


■a + be 


a-b + c 


abc + ■ 


(b) 


+ -abc 


a-b + c 


abc + 


(c) 


+-ab-ac 


a-b + a-c 


ab-ac-+ 
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(a) 




(b) 




7. Solution #1: 

Basis: A binary tree consisting of a single vertex, which is a leaf, satisfies the equation leaves = internal vertices + 1 , 

Induction: Assume that for some k > 1 , all full binary trees with k or fewer vertices have one more leaf than internal vertices. Now consider 

any full binary tree with k + 1 vertices. Let T A and T B be the left and right subtrees of the tree which, by the definition of a full binary tree, must 
both be full. If i^and/^ are the numbers of internal vertices in T A a.ndT B , and j A andj B are the numbers of leaves, then 
Ja = 'A + 1 and js = ia + 1 ■ Therefore, in the whole tree, the number of leaves = j A + Jb 

= (i A + 1) + (i B + 1) 

= (i A + i B + 1) + 1 

= (number of internal vertices) + 1 

Solution #2: Imagine building a full binary tree starting with a single vertex. By continuing to add leaves in pairs so that the tree stays full, 
we can build any full binary tree. Our starting tree satisfies the condition that the number of leaves (1) is one more than the number of internal 
vertices (0). By adding a pair of leaves to a full binary tree, an old leaf becomes an internal vertex, increasing the number of internal vertices by 
one. Although we lose a leaf, the two added leaves create a net increase of one leaf. Therefore, the desired equality is maintained. 

Supplementary Exercises— Chapter 10 

1 . Each of the n — 1 edges of a tree contributes to the degrees of two vertices . Therefore the sum of all degrees of vertices in an n vertex tree is 

2(n-l) = 2n-2. 

3. (a) Gi is graceful : v, = 1 , v 2 + 2, v 3 = 4 

G4 is graceful : v t =2, v 2 = 1 , V3 = 3, V4 = 4 

(b) Starting at either end of the chain label the first vertex 5(1) = 1 and the (k + 1) st vertex, k > 1 , S(k + 1) = S(k) + k. The edge connecting the 
kth and (A: + 1) st vertex is the kth edge and since (S{ k + 1) - s(k)) = k, the chain is graceful. The closed form expression for 

S(k)isl+(k(k- 1 -)). 

5. First, {3, 6} is added to the edge set, then {1, 2) and (3, 4). Then (4, 6} is rejected since it would complete a cycle. This can be seen from the 
forest. 





Vertices 4 and 6 have the same root in this tree; hence (4, 6) is rejected. (1,5) and {2, 3) are the final edges that complete the minimal spanning 
tree. Notice that (4, 6) could have been the second edge selected. In that case, (3, 4) would be rejected. 

7. The depth of the tree is four. 
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9. (a) 





(b) aa ■ 2 a ■ b ■ +b + is the postorder traversal of the tree. This is also the postfix version of the original expression. 
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